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h IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Application of: Quertermous et al. 



Serial No. : To Be Assigned 

Filed: Concurrently with Rule 53(b) 
Divisional Application of Serial 
No. 08/659,235 

For: DEVELOPMENTALLY- 

REGULATED ENDOTHELIAL 
CELL LOCUS- 1 



Group Art Unit: To Be Assigned 

Examiner: To Be Assigned 

Attorney Docket No.: 
8907-078-999 



PRELIMINARY AMENDMENT 



Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

Prior to the examination of the above-identified application, please consider the 
following amendments and remarks. 



IN THE TITLE 

At page (i), line 1, and at page 1, line 3, please delete the title and insert the 
following in its place: 

-ANTIBODIES BINDING TO POLYPEPTIDES ENCODED BY 
DEVELOPMENT ALLY-REGULATED ENDOTHELIAL CELL LOCUS-1- 
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IN THE SPECIFICATION 

At page ii, lines 27-31, please delete the paragraph on priority information and 
insert the following in its place: 

-This is a divisional application of United States Patent Application Serial No. 
08/659,235, filed June 5, 1996 (allowed), which is a continuation-in-part of United States 
Patent Application Serial No. 08/480,229, filed June 7, 1995 (allowed), each of which is 
incorporated herein in its entirety.— 



IN THE CLAIMS 

Please cancel Claims 21-27 and 33 without prejudice. 
Please add the following new claims: 

-34. (New) An antibody which binds to a polypeptide, said polypeptide 
comprising the amino acid sequence as shown in SEQ ID NO: 10, or a fragment of the 
antibody that binds said polypeptide. 

35. (New) The antibody of Claim 34 which specifically binds to a structurally 
unique epitope in the polypeptide. 

36. (New) The antibody of Claim 34 which is a Fab, F(ab') 2 , single chain or 
chimeric. 

37. (New) The antibody of Claim 34 which is a polyclonal antibody. 

38. (New) The antibody of Claim 34 which is a monoclonal antibody. 

39. (New) The antibody of Claim 34 which competitively inhibits the binding of 
said polypeptide to alpha V beta 3 . 

40. (New) A method of detecting a Del- 1 -expressing cell in a cell mixture, 
comprising incubating the cell mixture with the antibody of Claim 34; and detecting an 
antibody-bound cell. 

41 . (New) The method of Claim 40 in which the antibody is linked to a 
detectable label. 
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42. (New) An antibody which binds to a polypeptide, said polypeptide 
comprising the amino acid sequence as shown in SEQ ID NO: 29, or a fragment of the 
antibody that binds said polypeptide. 

43 . (New) The antibody of Claim 42 which specifically binds to a structurally 
unique epitope in the polypeptide. 

44. (New) The antibody of Claim 42 which is a Fab, F(ab') 2 , single chain or 
chimeric. 

45. (New) The antibody of Claim 42 which is a polyclonal antibody. 

46. (New) The antibody of Claim 42 which is a monoclonal antibody. 

47. (New) The antibody of Claim 42 which competitively inhibits the binding of 
said polypeptide to alpha V beta 3 . 

48. (New) A method of detecting a Del- 1 -expressing cell in a cell mixture, 
comprising incubating the cell mixture with the antibody of Claim 42; and detecting an 
antibody -bound cell. 

49. (New) The method of Claim 48 in which the antibody is linked to a 
detectable label. 

50. (New) An antibody which binds to a polypeptide, said polypeptide 
comprising the amino acid sequence as shown in SEQ ID NO: 14 from residues #33 through 
#513, or a fragment of the antibody that binds said polypeptide. 

51. (New) The antibody of Claim 50 which specifically binds to a structurally 
unique epitope in the polypeptide. 

52. (New) The antibody of Claim 50 which is a Fab, F(ab f ) 2 , single chain or 
chimeric. 

53. (New) The antibody of Claim 50 which is a polyclonal antibody. 

54. (New) The antibody of Claim 50 which is a monoclonal antibody. 

55. (New) The antibody of Claim 50 which competitively inhibits the binding of 
said polypeptide to alpha V beta 3 . 
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56. (New) A method of detecting a Del- 1 -expressing cell in a cell mixture, 
comprising incubating the cell mixture with the antibody of Claim 50; and detecting an 
antibody-bound cell. 

57. (New) The method of Claim 56 in which the antibody is linked to a 
detectable label. 

58. (New) An antibody which binds to a polypeptide, said polypeptide 
comprising the amino acid sequence as shown in SEQ ID NO: 14 from residues #33 through 
#513 with the exception that the amino acid sequence from residues #98 through #107 is 
removed. 

59. (New) The antibody of Claim 58 which specifically binds to a structurally 
unique epitope in the polypeptide. 

60. (New) The antibody of Claim 58 which is a Fab, F(ab')2, single chain or 
chimeric. 

61. (New) The antibody of Claim 58 which is a polyclonal antibody. 

62. (New) The antibody of Claim 58 which is a monoclonal antibody. 

63. (New) The antibody of Claim 58 which competitively inhibits the binding of 
said polypeptide to alpha V beta 3 . 

64. (New) A method of detecting a Del- 1 -expressing cell in a cell mixture, 
comprising incubating the cell mixture with the antibody of Claim 58; and detecting an 
antibody-bound cell. 

65. (New) The method of Claim 64 in which the antibody is linked to a 
detectable label.— 



REMARKS 

This Preliminary Amendment is being submitted concurrently with Applicants' 
Request for Filing a Divisional under 37 C.F.R. § 1.53(b) of parent Application Serial No. 
08/659,235, filed June 5, 1996. In this Amendment, Claims 21-27 and 33 have been 
cancelled, and replaced by new Claims 34-65. Claims 21-27 and 33 were encompassed in 
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Group IV in a Restriction Requirement dated February 11, 1997 in connection with the 
parent application. The new claims are supported by the specification at page 36, line 28 to 
page 39, line 2. More specifically, the polypeptide recited in Claim 34 is described in 
Figure 3A-3E. The polypeptide recited in Claim 42 is described in Figure 12A-12E. The 
polypeptide recited in Claim 50 is described in Figure 4A-4C. The polypeptide recited in 
Claim 58 is described at page 49, lines 23-27 and in Figure 11. Claims 36-38, 44-46, 52- 
54 and 60-62 are described at page 36, lines 30-33, and page 38, lines 1-11 and lines 28- 
37. Claims 39, 47, 55 and 63 are described at page 25, lines 32-34. Claims 40, 48, 56 and 
64 are described at page 25, line 36 to page 26, line 2. No new matter has been introduced 
by the new claims. 

The title of the application has been amended to reflect the presently claimed 
invention. The specification has been amended to insert up-dated priority information. 
The amendments do not introduce new matter. 

Applicants respectfully request that this Amendment be made of record in the file 
history of the instant application. 



Respectfully submitted, 



Dated: 





Enclosure 



PENNIE & EDMONDS LLP 

1155 Avenue of the Americas 
New York, New York 10036-2711 
(212) 790-9090 
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DEVSLOPMENTALLY- REGULATED ENDOTHELIAL CELL LOCUS - 1 

5 This invention was made, in part, with government 

support under HD 25580 awarded by the National Institutes of 
Health. The government may have certain rights in the 
invention . 

10 1. INTRODUCTION 

The Dresent invention relates to a member of a novel 
gene family referred to as development ally -regulated 
endothelial cell locus - 1 ( del - 1 ) , In particular, the 
invention relates to del -1 nucleotide sequences, Del - 1 amino 

15 acid sequences, methods of expressing a functional gene 
product, antibodies specific for the gene product, and 
methods of using the gene and gene product. Since del -1 is 
expressed in endothelial cells and certain cancer cells, it 
may be useful as an endothelial cell and tumor marker. In 

20 addition, the ability of Del-1 protein to inhibit vascular 
formation provides for its use as an ant i- angiogenic agent. 



2 . 3ACKGR0UND OF THE INVENTION 

2.1. ENDOTHELIAL CELL BIOLOGY AND 
BLOOD VESSEL DEVELOPMENT 

The endothelium occupies a pivotal position at the 
interface between the circulating humoral and cellular 
elements of the blood, and the solid tissues which constitute 
the various organs. In this unique position, endothelial 
cells regulate a large number of critical processes. Such 
processes include leukocyte adherence and transit through the 
blood vessel wall, local control of blood vessel tone, 
modulation of the immune response, the balance between 
thrombosis and thrombolysis , and new blood vessel development 
(-Bevilacqua et al . , 1393, J". Clin. Invest 91:379-337; Folkman 
et al., 1987, Science 235:442-447; Folkman et al . , 1992, J. 
Biol. Chen. 267:10931-10934; Gimbrone , 1935, Churchill 
Livingstone, London; Issekutz , 1992, Curr . Opin . Immunol. 
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4:287-293; Janssens et al . , 1992, J. Biol. Chem. 257:14519- 

14522; Lamas ec al . , 1992, Proc . Nazi. Acad. Sci . U.S.A. 

89:6348-6352; Luscher en al . , 1992, Hypertension 19:117-130; 

Williams et al . , 1992, Am. Rev. Respir. Dis. 146:S45-S50; 
5 Yanagisawa, et al. , 1988, Nature 332:411-415). 

Endothelial cell dysfunction has been postulated as a 

central feature of vascular diseases such as hypertension and 

atherosclerosis. In this context, the ability of the 

endothelium to synthesize smooth muscle cell mitogens and 
10 factors which control smooth muscle contraction has received 

much attention (Janssens ec al . , 1992, J. Biol. Chem. 

267:14519-14522; Lamas et al . , 1992, Proc. Natl. Acad. Sci. 
ifl U.S.A. 89:6348-6352; Luscher et al . , 1992, Hypertension 

! y 19:117-130; Raines ec al . , 1993, Br. Heart J. 69:S30-S37 ; 

**i • 15 Yanagisawa et al., 1988, Nature 332:411-415). The 

endothelial cell has also become che focus of attention in 
M the study of diseases which are not primarily vascular in 

l \ nature . Diverse disease processes such as adult respiratory 

'{2 distress syndrome, septic shock, solid tumor formation, tumor 

m 2 0 cell metastasis, rheumatoid arthritis, and transplant 

3 : m£ 

rejection are now understood to be relaced to normal or 
■ill aberrant function of the endothelial cell. A raoidly 

increasing number of pharmacologic agents are being developed 
whose primary therapeutic action will be co alter endothelial 

2 5 cell function. In addition, recent attention on gene therapy 

has focused on the endothelial cell (Nabel ec al . , 1991, J". 
Am. Coll. Cardiol. 17 : 1S9B-194S) . Transfer of genes into the 
endochelial cell may afford a therapeutic strategy for 
vascular disease, or the endothelium may serve simply as a 

3 0 convenient cellular factory for a missing blood borne factor. 

Hence, information regarding fundamental processes in the 
endothelial cell will aid the understanding of disease 
processes and allow more effective therapeutic strategies. 

Studies from a number of laboratories have characterized 
35 the ability of the endothelial cell to dramatically alter 
basic activities in response to cytokines such as tumor 
3 necrosis factor (TNF) -alpha. TNF -alpha stimulation induces 
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significant alterations in the production of vasoactive 
compounds such as nitric oxide and endothelin, increases 
surface stickiness coward various types of leukocytes, and 
modulates the expression of both pro- and anti-coagulant 
5 factors (Cotran et al . , 1990, J\ Am. Soc . Nephrol. 1:225-235; 
Mantovani et al . , 1992, FASE3 J. 6:2591-2599). In turn, 
endothelial cells have been shown to be an important source 
for the production of cytokines and hormones, including 
interleukin 1, 6 and 8 (Gimbrone et al . , 1989, Science 
10 246:1601-1603; Locksley et al . 1987, J . Immunol. 139:1891- 
1895; Loppnow et al . , 1989, Lymphokine . Res. 8:293-299; 
Warner et al., 1987, J. Immunol. 139:1911-1917). 

The ability of endothelial cells to produce granulocyte, 
granulocyte-macrophage, and macrophage colony stimulating 
15 factors has led to speculation that endothelial cells are an 
important facet of hematopoietic development (Broudy et al . , 
1987, J. Immunol. 139:464-468; Seelentag et al . , 1987, EMBO 
J. 6:2261-2265). Early studies have provided the foundation 
for the cloning of a large number of "endothelial cell- 
2 0 specific" genes. Some of these include I CAM - 1 , I CAM - 2 , VCAM- 
1, 3LAM-1, endothelin-1 , constitutive endothelial cell nitric 
oxide synthetase, thrombomodulin, and the thrombin receptor 
(3evilacqua et al . , 1989, Science 243:1160-1165; Jackman et 
al . , 1986, Proc. Nazi. Acad. Sci . U.S.A. 83:8334-8838; 
25 Janssens et al . , 1992, J. Biol. Chem. 267:14519-14522; Lamas 
et al., 1992, Proc. Natl. Acad. Sci. U.S.A. 8 9:6348-6352; 
Osborn et al . , 1989, Cell 5 9:1203-1211; Staunton et al . , 
1989, Nature 339:61-64; Staunton et al . , 1988, Cell 52:925- 
933; Vu et al, 1991, Cell 64:1057-1068 ; Yanagisawa et al . , 
30 1988, Nature 332:411-415). 

All blood vessels begin their existence as a capillary, 
composed of only endothelial cells. Much of the molecular 
research investigating the role of endothelial cells in blood 
vessel development has focused on this process in the adult 
35 organism, in association with pathological conditions. In 

these situations, new blood vessels are formed by buddina and 
branching of existing vessels. This process, which depends 



on endothelial cell division, has been termed anaioaenesis . 
Research on this process has focused primarily on small 
proceins which are growth factors for endothelial cells 
{Folkman et al . , 1987, Science 235:442-447; Folkman et al . , 

5 1992, J . Biol. Chem. 267:10931-10934). Sensitive bioassays 
for angiogenesis have allowed the characterization of a 
number of angiogenic factors, from both diseased and normal 
tissues. Members of the fibroblast growth factor (FGF) 
family, platelet-derived endothelial cell growth factor, and 

0 vascular endothelial cell growth factor (vascular 

permeability factor) , are a few of the angiogenic factors 
which have been characterized (Folkman et al . , 1987, Science 
235:442-447; Folkman et al . , 1992, J". Biol. Chem, 267:10931- 
10934; Ishikawa et al . , 1989, Nature 338:557-562; Keck et 

5 al., 1989, Science 246:1309-1312; Leung et al . , 1989, Science 
246 : 1306-1309) . 

Such information has provided some insight into the 
study of blood vessel development in the embryo. Studies 
linking vascular development to an angiogenic factor have 
0 resulted in the work with vascular endothelial cell growth 
factor (VEGF) . VEGF expression has been correlated in a 
temporal and spatial fashion with blood vessel development in 
the embryo (3reier et al . , 1992, Development 114:521-532). A 
high affinity VEGF receptor, ilk-1, has been shown to be 
5 expressed on the earliest endothelial cells in a parallel 
fashion (Miliauer et al . , 1993, Cell 72:835-846). 

Blood vessels form by a combination of two orimarv 
processes. Some blood vessel growth depends on angiogenesis, 
in a process very similar to that associated with 
0 pathological conditions in the adult. For instance, the 
central nervous system depends solely on angiogenesis for 
development of its vascular supply (Noden, 1989, Am. Rev. 
Respizr. Dis . 140:1097-1103 ; Risau et al . , 1988, EMBO J . 
7:959-962). A second process, vasculogenesis , depends on the 
incorporation of migratory individual endothelial cells 
(angioblasts ) into the developing blood vessel. These 
angiobiasts appear to be components of almost all mesoderm, 



and are able to migrate in an invasive fashion throuahouc the 
embryo (Coffin ec al . , 1991, Anaz. Rec . 231:383-395 ; Noden, 
1989, Am. r?ev. Respir. Dis . 140:1097-1103 ; Noden, 1991, 
Development 111:867-876) . The precise origin of this cell, 
5 and the characteristics of ins differentiation have not been 
defined . 

Understanding of the molecular basis of endothelial cell 
differentiation in blood vessel development may allow 
manipulation of blood vessel growth for therapeutic benefit. 

0 The ability to suppress blood vessel growth may also provide 
therapeutic strategies for diseases such as solid tumors and 
diabetic retinopathy. On the other hand, diseases such as 
coronary artery disease may be treated through pharmacologic 
induction of directed blood vessel growth, through increasing 

5 collateral circulation in the coronary vascular bed. Both 
vascular diseases such as atherosclerosis and hypertension 
and nonvascular diseases which depend on the endothelial cell 
will benefit from a better understanding of endothelial 
cells . 

0 

2 - 2 . SPIDSRMAL GROWTH FACTOR -LIKE DOMAIN 
Epidermal growth factor (EGF) stimulates growth of a 
variety of cell types. EGF-like domains have been found in a 
large number of extracellular and membrane bound oroteins 

5 (Anderson, 1990, Experiencia 46 ( 1 ) : 2 ; and Doolittle, 1985, 
TIBS, June : 23 3 ) . These proteins include molecules that 
function as soluble secreted proteins, growth factors, 
transmembrane signal and receptor molecules, and components 
of the extracellular matrix (Lawler and Hynes, 1986, J. Cell. 

0 Biol. 103:1635 ; Durkin et al . , 1988, J". Cell Biol. 107:2749; 
Wu et al., 1990, Gene 86:275; Bisgrove and Raff, 1993, 
Develop. Biol. 157:526;). 

In many cases, multiple tandem reoeats of a 
characteristic 40 amino acid long, 6 cysteine-containing 
5 sequence are observed (Anderson, 1990, Ej<perienzia 46(1) :2). 
EGF-like domains are homologous to the peptide growth factor 
EGF which consists of a single copy of the standard EGF 



domain. These domains have been highlv conserved in 
evolution, being found in species as diverse as nematodes, 
Drosophila , sea urchins, and vertebrates . 

The EGF molecule and the closely relaced transforming 
5 growth factor (TGF) alpha induce cell proliferation by 

binding to a tyrosine kinase receptor. It has been suggested 
that other EGF-like domains also function as ligands for 
receptor molecules (Engel, 1989, FEBS Letc. 251:1-7). 
Fundamentally, SGF repeats are protein structures that 
0 participate in specific protein-protein binding interactions. 
The Drosophila Notch protein, the Nematode lin-12 and 
glp-1 proteins, and the closely related vertebrate homologs, 
Motch (mouse Notch) , Xotch (Xenopus Notch) , rat Notch, and 
TAN 1 (human Notch) are membrane bound receptor molecules 
5 that control the specification of cell fate for a variety of 
cell types early in embryogenesis (Rebay et al . , 1991, Cell 
67:687; Hutter and Schnabel , 1994, Development 120:2051; Del 
Amo et al 1992, Development 115:73 7; Reaume et al . 1992 
Develop. Biol. 154:377; and Ellisen et al . , 1991, Cell 
0 66:649). Specific EGF-like repeats in the Notch receptors 
are binding sites that attach to protein ligands leading to 
signal transduction (Rebay et al . , 1991 Cell 67:687; Couso 
and Arias, 1994, Cell 79:259; Fortini and Art avanis -Tsakonas , 
1994, Cell 79:273; Henderson et al . , 1994, Development 
5 120:2913) . Extracellular matrix proteins such as 

thrombospondin, entactin, tenascin and laminin play key roles 
in morphogenesis by providing the physical scaffold to which 
cells attach to form and maintain tissue morphologies 
(Frazier, 1987, J". Cell. Biol. 105:625 ; Taraboletti et al . , 
0 1990, J . Cell. Biol. 111:765; Ekblom et al . , 1994, 
Development 120:2003). 

2 • 3 ■ DISCOIDIN I /FACTOR VIII -LIKE DOMAINS 
A homologous domain structure has been discovered in 
5 coagulation factors VIII and V (Kane and Davie, 1986, Proo . 
Natl. Acad. Sci . U.S.A. 33:6300). This domain is related to 
a more ancient structure first observed in the discoidin I 



r 
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protein produced by the cellular slime mold Dictyostelium 
discoideum. Discoidin I is a carbohydrace binding lectin 
secreted by Dictyostelium cells durinc the orocess o^ 
cellular aggregation and is involved in cell -substratum 
5 attachment: and ordered cell migration (Springer et al . , 1984, 
Cell 3 9 : 557) . 

Discoidin I/factor VIII -like domains have also been 
observed in a number of other proteins. For example, milk 
fat globule protein (BA4 6) , milk fat globule membrane protein 
10 (MFG-E3 ) , breast cell carcinoma discoidin domain receptor 
(DDR) , and the Xenopus neuronal recognition molecule (A5) 
(Stubbs et al., 1990, Proc . Nazi. Acad. Sci . U.S.A. 87:8417; 
Larocca et al . , 1991, Cancer Res. 51:4994; Johnson et al . , 
1993, Proc. Natl. Acad. Sci. U.S.A. 90:5677). The discoidin 
15 I/factor VIII -like domains of the vertebrate proteins are all 
distantly related to the Dictyostelium sequence but more 
closely related to each other. 

Discoidin I/factor VIII -like domains are rich in 
positively charged basic amino acids and are believed to bind 
20 to negatively charged substrates such as anionic 

phospholipids or proteoglycans. 3oth of the milk fat globule 
proteins have been shown to associate closely with cell 

mi 

membranes and the coagulation factors VIII and V interact 
with specific platelet membrane proteins (Stubbs et al . , 1990 
25 Proc. Nat. Acad. Sci. U.S.A. 87:8417; Larocca et al . , 1991, 
Cancer Res. 51:4 994) 



3 . SUMMARY OF THE INVENTION 

The present invention relates to a novel gene family 
referred to as del-1. In particular, it relates to del-1 
nucleotide sequences, expression vectors containing the 
sequences, genetically-engineered host cells exoressinc del- 
1, Del-l protein, Del-1 mutant polypeptides, methods of 
expressing del-1 and methods of using del-1 and its gene 
product in various normal and disease conditions such as 
cancer . 
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The invention is based, in pare, upon Applicants 7 
isolation of a murine BNA clone (SSQ ID NO: 9), dal-1, and 
its homologous human counterpart (SSQ ID NO: 11) . Structura 
features of the Del - 1 protein are deduced by homology 
5 comparisons with sequences in the Genbank and N3RF-PIR 
databases. The protein is a modular molecule composed of 
repeats of two different sequence motifs which are present i: 
a number of distinct proteins. The two sequence motifs are 
known as the EGF-like domain (SEQ ID NO : 26) and the 
0 discoidin I/factor VHI-Iike domain (SEQ ID NOS : 1-8). Thes« 
domains are defined by characteristic patterns of conserved 
amino acids distributed throughout the molecule at specific 
locations. While Del-i shows certain sequence homology with 
other proteins, it is unique in both its primary sequence anc 
5 its overall structure. In all cases in which EGF-like and 
discoidin I-like domains have been identified, both of these 
structures are always found in extracellular locations. 
Variant forms of Del-1 protein exist, and one form is shown 
herein to be an extracellular matrix protein and is 
associated with the cell surface. The expression pattern of 
dal-1 further indicates that it is involved in endothelial 
cell function. In addition, a number of human tumor cells 
express del -2. Furthermore, host -derived blood vessels that 
traverse the tumor nodule also express del-1. The Del-l 
protein inhibits vascular morphogenesis and binds to aV/33 as 
its cellular receptor. Therefore, a wide variety of uses ar€ 
encompassed by the present invention, including but not 
limited to, the use of Del -I as a tumor marker for cancer 
diagnosis and treatment, the isolation of embryonic 
endothelial cells, the identification of Del-1 bindina 
partners, and the stimulation or inhibition of endothelial 
cell growth and blood vessel formation. 

4 • BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Genomic organization of 42 kb of the murine 

dal-1 locus, as characterized by 
cloning from a Xf ix library constructed 



from the SLM2 75 transgenic mouse, and a 
wildtype 129SV Xfix library. The 
dashed line indicates DNA studied to 
date by zoo bloc and exon trapping. 
The location of the exon identified by 
exon trapping is shown. 

Homology analysis between the deduced amino 
acid sequence of the putative del -2 gene 
(m-dell) (SEQ ID NO: 1) and other proteins 
with "discoidin-like domains . " Identical 
residues are boxed, conserved residues are 
shaded (Geneworks , Incelligenetics , Mountain 
View, CA) . m-del-l sequence (SEQ ID NO: 1) 
was derived from a trapped exon and mouse 
embryo cDNAs . Abbreviations : h-MFG , human 
milk fat globule protein (SEQ ID NO: 2); 
h-FV, human coagulation factor V (SEQ ID NO: 
3); m-FVIII , mouse coagulation factor VIII 
(SEQ ID NO: 4); X-A5bl (SEQ ID NO: 5) and 
X-A5b2 (SEQ ID NO: 6), bl and b2 domains of 
Xenopus neuronal antigen A5 ; dis-I, 
discoidin I (SEQ ID NO: 7) . 

Nucleotide sequence and deduced amino acid 
sequence of murine del -1 cDNA (SEQ ID NO : 
9) . 

Nucleotide sequence and deduced amino acid 
sequence of human del -I cDNA (SEQ ID NO: 
II) . 

Murine del-1 fragment: (SEQ ID NO: 19) used 
as probe for human del-1 cloning and 
Northern blot: analysis. 



murine (m-del-i) (SEQ ID NO: 10) and human 
(h-del-1) (SEQ ID NO: 29) Del-I proteins. 
The EGF-like and discoidin- like domains are 
indicated by r 'egf " and "discoidin, " 
resueccivelv . 

The small rectangles labeled "EG?" show the 
location and relative sizes of the three 
EGF- like domains of Del -1 . These regions of 
the protein are approximately 40 amino acids 
long. Each EGF- like domain contains six 
cysteine residues and additional conserved 
amino acids, distributed in a pattern which 
is highly conserved among proteins that 
contain this common motif. In addition, the 
amino acid sequence RGD occurs in the center 
of the second EGF-like repeat. This 
sequence is found in a variety of 
extracellular matrix proteins and, in some 
cases, it is required for binding to 
integrin proteins. An RGD sequence is 
present in the same position in the second 
SGF-like repeat of MFG-E3. 
The large rectangles on the ricrht side 
represent tandem discoidin I/factor VIII - 
like domains. This protein motif is based 
on a conserved pattern of amino acids 
defined by the homology between the D . 
discoidium discoidin I protein and mammalian 
coagulation factor VIII. 

The 54.2% amino acid homology between human 
Del-1 and MFG-E8 (SEQ ID NO: 21} in the 
tandem discoidin I/factor VIII domains is 
shown. These domains are rich in the basic 
amino acids arginine and lysine. The 5' 
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aomam contains 12 arginines and 12 lysines 
versus 9 acidic residues, while the 3' 
domain contains 8 arginines and 10 lysines 
versus 16 acidic residues. A similar domain 
in the coagulation factor VIII protein is 
believed to bind to negatively charged 
phospholipids on the surface of platelets. 
The MFG-E8 protein has been found to 
associate tightly with milk fat globule 
membranes . 



The predicted amino acid sequence at the 
amino terminus of the human Del-1 protein 
(SEQ ID NO: 22) shows characteristics common 
to signal peptides. The putative signal 
begins with a basic arginine residue and is 
followed by a stretch of 18 amino acids rich 
in hydrophobic residues. Signal peptides 
typically end with a small amino acid such 
as glycine or alanine. In addition, the 
Chou and Fasman algorithm predicts that the 
putative signal sequence is followed by a 
protein turn structure, a feature commonly 
found after signal peptides. The Del-1 
protein is secreted by expressing cells. 

Sequence similarities between the three EGF- 
like domains of Del-1 (SEQ ID NOS : 23-25} 
and homology with the consensus EGF-like 
domain amino acid sequence (SEQ ID NO: 26). 
Also, the amino acid sequence RGD is in the 
center of the second EGF-Iike repeat. This 
sequence is found in a variety of 
extracellular matrix proteins and, in some 
cases is required for binding to integrin 
proteins . An RGD sequence is oresent in the 



same oosition in the second EGF-like reioeat 
of MFC-S8 . 



Human del - 1 splicing variant partial 
sequence (SSQ ID NO: 27) showing the 
variation as compared with the major form. 

Murine del -I truncated minor nucleotide and 
deduced amino acid sequences (SEQ ID NO: 
28) . 

X-gal staining in whole mount and tissue 
sections of embryos from the SLM275 line. 
(13A) Embryo ac 7.5 days pc (headfold stage 
stained as whole mount. X-gal staining is 
seen in cells of the extraembryonic mesoder 
(xm) which will give rise to the yolk sac 
and associated blood islands. 
Abbreviations: ng, neural groove. 
Photographed at 70x. (13B) Section of yolk 
sac blood islands from 8 day pc embryo 
stained as a whole mount with membranes 
intact and subsequently sectioned and 
counter stained , Clusters of round cells in 
the blood islands show X-gal staining 
(arrow) , while mature endothelial cells do 
not stain (open arrowhead) . Photographed a 
400x. (13C) Embryo at 9.5 days pc . 
Prominent X-gai staining (blue-green) is 
seen in the heart and outflow tract (mid- 
portion of embryo) . In addition, the aorta 
(arrowhead) and intervertebral vessels are 
stained. Photographed at approximately 3 Ox 
darkfield illumination. (13D) Section of 
9.5 day embryo showing heart and outflow 
tract. This section indicates that X-gal 
staining in the heart and outflow tract is 
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restricted co the endothelial cells 
(endocardium) . Section was counters tained 
with hematoxylin and eosin , photographed at 
200x. (13E) Embryo at 13.5 days pc, 
5 dissected and X-gal stained as a whole 

mount. At this stage, as confirmed by study 
of tissue sections, endothelial cells lining 
the ventricle (v) and large vessels such as 
the aorta (filled arrowhead) have lost most 

10 of their staining. Staining of the 

endothelial cells of the atrium (a) has 
diminished but is still anoarent in the 
whole mount. Most pronounced at this stage 
is staining in the developing lungs (open 

15 arrowheads) . X-gal staining cells are 

clearly associated with the glandular buds 
of the lung, but it is not possible to 
identify these cells in the whole mount. 
The only non-cardiovascular cells which 

20 exhibit X-gal staining are cells in the 

regions of ossification, such as in the 
proximal ribs shown here. Photographed at 
50x. ( 13F) Embryo at 13.5 days, stained as 
whole mount, sectioned, counter stained with 

25 nuclear fast red. X-gal staining in lung 

tissue shown here is associated with 
endothelial cells, as seen in vascular 
channels cut in transverse (arrow) and 
longitudinal (arrowhead) planes. Staining 

3 0 is not associated with bronchial cells. 

Section was photographed at 4 0 Ox . 
(13G) Cross -sect ion through a valve forming 
in the outflow tract of a 13.5 day embryo. 
Endothelial cells in blood vessel wall are 

3 5 undergoing an epithelial -mesenchymal 

transformation, leadina to formation of the 
valve tissue . Stained cells are seen within 



the rormmg valve structure, indicating that 
these ceils continue no express the del -2 
marker during this phenotypic 

transformation. The embryo was stained as a 
whole mount, sectioned, counterstained with 
nuclear fast red and photographed at 400x . 
(13H) Spiral septal formation in the outflow 
tract of the heart at 9.5 days pc . 
Endothelial cells are undergoing an 
epithelial -mesenchymal transformation , 
becoming mesenchymal in morphology and 
behavior. Endothelial cells continue to 
express the transgene marker for some time 
after this transformation. Section from 
whole mount stained embryo, 2 0 Ox. 

Immunoblott ing employing del -1 transf ected 
yolk sac cells. (14A) Yolk sac YS-B cells 
stably transfected with a eukaryotic 
expression vector encoding the murine major 
form of del-l(+) , or an empty expression 
vector (-) were selected and evaluated as 
pools for expression of Dei - 1 protein. 
Protein was isolated from cells lysed in 
cell lysis buffer (Lysis) or standard 
Laemmli gel loading buffer (Laemmli) , or 
from the extracellular matrix remaining 
after transfected cells were removed from 
the culture dish (ECM) . The dominant band 
corresponds to a molecular weight of 52 
kilodal tons (kDa) . Lower molecular weight 
bands most likely represent protein 
degradation products, although the use of 
alternative translation initiation sites is 
also possible. (143) YS-3 cells were stably 
transfected with the del -1 expression 
construct, or the empty expression plasmid, 



and selected as individual clones. Clones 
expressing del - 1 were selected for varying 
levels of protein production, as assayed by 
western blot analysis of extracellular 
matrix protein. Clone L10 shows the highes 
level of del -I mRNA, clones L13 and L14 hav 
an intermediate amount of message, and a 
negative control clone does not express del 
1 . 



Immunostaining of yolk sac cells. (ISA) 
del-1 transf acted yolk sac cells and the 
extracellular matrix are stained with anti- 
Del -1 antibody. The arrows indicate cell 
membrane staining. (153) Mock- transf ected 
yolk sac cells are not stained with 
antibody . 

Immunostaining of Del-1 in the developing 
bone (vertebral column) of a 13.5 day mouse 
embryo. The laquanae within the bone are 
structures composed of extracellular matrix 
proteins and they are stained for Del-1. 

Immunostaining of human glioma grown in nude 
mice. (17A) tumor cells are stained with 
anti-Del-1 antibody. Polarized staining 
pattern is observed (arrows) . (17B) a blood 
vessel is stained with anti-Del-1 within the 
tumor . 



(I8A) The parental yolk sac cell line Y3-3 
under routine culture conditions. Phase 
contrast, photo lOOx. (133) YS-3 cells 
after 24 hrs on "MATRIGEL" show a pattern of 
vascular morphogenesis. Cells were stained 
with toluidine blue. Brightfield, photo 



4 Ox . (18C) Negative control transfeccancs 
form a vascular network on "MATRIGEL" after 
24 hours. Light: areas represent organized 
cells; photographed under dark field 
illumination at 50x. (1SD) Yolk sac 
transf ectant , clone L10, after 24 hrs on 
"MATRIGEL" shows no evidence of vascular 
formation, cells instead produce numerous 
aggregates. Darkfield illumination, photo 
50x. (18S) Parental yolk sac YS-B cells 
grown on a matrix produced by negative 
control transf ectants make a complex 
structural network. Light: areas represent 
organized cells; photographed under dark 
field illumination at 30x. (IS?) Parental 
YS-3 cells grown on a matrix produced by 
del -1 transf ectants . Cells are forming a 
dense monolayer, with no evidence of 
organization. Photographed under darkfield 
illumination at 30x. (18G) Aggregates of 
negative control transf ected yolk sac cells 
are placed onto polymerized "MATRIGEL" . 
After 24 hrs, cells show sprouting 
angiogenesis . Photographed under phase 
contrast, at lOOx. (1SH) Aggregates of del 
1 transf ected yolk sac clone L10 are placed 
onto polymerized "MATRIGEL" as in 18G. 
Photographed after 24 hrs (lOOx) , these 
cells show no evidence of sprouting. 

The binding of murine recombinant Del-i to 
HUVEC is inhibited by an anti-aV/?3 antibody 
The relative cell number of HUVEC adhered t< 
plates coated with recombinant Del - 1 is 
shown in the presence of various antibodies 



The binding of murine recombinant Dei-i to 
HUVEC is inhibited by RGD peptides. The 
relative ceil number of HUVEC adhered to 
plates coated with recombinant Del - 1 is 
shown in the presence of RGD and RGE 
peptides at 10 /ig/ml. 

Two ideograms illustrating the chromosomal 
position of Pi clone 10043 at 5ql4 . (21A) 
nomenclature for human chromosomes adopted 
from the International System for Human 
Cytogenetic Nomenclature (19S5) . (213) an 
ideogram adopted from Cytogene t . Cell Genet. 
65:206-219 (19 94) which shows the relative 
band positions and arm ratios derived from 
actual chromosome measurements . 

5 . DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to a novel family of genes 
herein referred to as del -I. Described below are methods for 
cloning members of this gene family, characteristics of a 
murine member and its human homo log , expression of 
recombinant gene products, and methods of using the gene and 
its gene product. Structurally, members of this gene family 
contain three EGF-like domains and two discoidin I/factor 
VII I -like domains . 

The overall structure of the del -1 molecule is similar 
to the milk fat globule membrane protein (MFG-E8) (Stubbs et 
al . , 1990,. Proc. Natl. Acad. Sci . USA 37:8417). MFG-E8 is 
highly expressed by a large portion of human breast tumors as 
well as by lactating mammary epithelial cells. It consists 
of two tandem EGF-like domains followed by two discoidin 
I/factor VII I -like domains. The function of MFG-E8 is not 
known but it has been shown to associate closely with cell 
membranes and has been investigated as a target for antibody- 
based tumor imaging techniques . The observed association of 
MFG-ES with cell membranes indicates the potential use of 



Figure 20 



Figure 
21A & 21B 



antibodies against Del-1 to identify and sore endothelial 
ceils from mixed cell populations, and to target tumor cells 
that express Del-1 for diagnosis and therapy. 

The second EGF-like repeat of lMFG- E8 contains the amino 
5 acid sequence arg-gly-asp (RGD) in the same position as the 
second EGF-like repeat of Del-1. The RGD sequence has been 
shown to be a cell binding site for fibronectin, discoidin I, 
nidogen/entactin, and tenascin (Anderson, 1990, Experientia 
46:2). The binding of fibronectin to cell surface integrin 

10 molecules through the RGD sequence has been extensively 

studied (Main et al . , 1992, Cell 71:671; Hynes, 1992, Cell 
69:11) . Integrins appear to be the major receptors by which 
cells attach to extracellular matrices. Substrate binding to 
integrins has been shown to initiate signal transduction 

15 leading to events such as tyrosine phosphorylation, 

cytoplasmic alkalinization , activation of secretion and 
differentiation (Hynes , 1992, Cell 69:11). The presence of 
the RGD sequence in Del-1 indicates that this portion of the 
molecule may bind cell surface integrins, possibly triggering 

20 certain developmental events. In particular, Del-1 is shown 
to bind to integrin orV03 on endothelial cells. In several 
cases, synthetic peptides containing the RGD sequence have 
been shown to compete with native protein for integrin 
binding and prevent the initiation of downstream events 

25 (Brooks et al . , 1994, Cell 79:1157). 

For clarity of discussion, the invention is described in 
' the subsections below by way of example for the del-1 genes 
and their products in mice and in humans. However, the 
findings disclosed herein may be analogously applied to other 

30 members of the del -I family in all species. 

5.1. THE DEL-1 CODING SEQUENCE 

The present invention relates to nucleic acid molecules 
and polypeptides of the del-1 gene family. " In a specific 
35 embodiment by way of example in Section 6, infra , murine and 
human del-1 nucleic acid molecules were cloned, and their 
nucleotide and deduced amino acid sequences characterized. 
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3oth the nucleotide coding sequence and deduced amino acid 
sequence of del -I are unique. In accordance with the 
invention, any nucleotide sequence which encodes the amino 
acid sequence of the del -1 gene product can he used to 
5 generate recombinant molecules which direct: the expression of 
del -1 gene . 

Enhancer trapping is a strategy which has been 
successfully employed in genetic analysis in Drosophila but 
is also applicable to higher organisms. This method 
0 identifies regulatory regions in genomic loci through their 
influence on reporter genes (Okane et al . , 1987, Proc . Natl. 
Acad. Sci. U.S.A. 84:9123-9127). The reporter gene, as a 
transcriptional unit under the control of a weak 
constitutively expressed eukaryotic promoter, is introduced 
5 into a large number of organisms. The offsprings of these 
organisms are then screened by analysis of the pattern of 
reporter gene expression. Lines which show expression in the 
appropriate cells at the appropriate time are maintained for 
further study. This strategy has successfully identified a 
0 number of loci in Drosophila involved in complex 
developmental processes . 

Enhancer trap experiments have been employed in mice to 
a limited extent (Allen et al . , 1988, Nature 333:852-855). A 
number of such experiments were through fortuitous 
5 integration of a reporter gene into a locus of interest 
(Kothary et al . , 1988, Nature 335:435-437). Using this 
method coupled with genomic and cDNA cloning, the murine 
del -1 locus associated with the iransgene was identified. A 
genomic library is generated from the transgenic mouse, and a 
0 probe from the transgene used to isolate clones containing 
the transgene and sequences flanking the integration site. 
Characterization of the regulatory region is accomplished by 
employing flanking sequences in functional assays, via 
transfection experiments with an appropriate cell culture 
5 line, or via further transgenic experiments (Bhat et al . , 
1988, Mol. Cell. Biol. 8:3251-3259). 



For analysis of the transcription unit, it: is necessary 
to identify a region of flanking sequence which contains a 
portion of exon. This has been accomplished by blindly using 
flanking genomic sequences as probes in northern blocs or zoo 
5 blots (Soinen et al . , 1992, Mechanisms of Development 39:111- 
123) . DNA fragments thus identified to contain exon sequence 
are employed as probes for cDNA cloning. Similar cloning 
experiments have been conducted to characterize loci 
inactivated by insertions! mutagenesis associated with 
LO transgene integration. These experiments indicate that 
deletions of large regions of genomic DNA may accompany 
transgene integration, and that: complexity of the 
transcription unit may greatly complicate this type of 
analysis (Karls et al . , 1992, Mol . Cell. Biol. 12:3644-3552; 
.5 Woychik et al . , 1990, Nature 346:850-853). 

Subsequent analysis of the del -2 sequence has revealed 
both EGF-like and discoidin I/factor VI II -like domains. The 
shared homology between del -I and other known molecules is 
discussed in Section 6.2, infra. However, this molecule also 
0 contains regions of previously unreported unique nucleotide 
sequences. Northern blot hybridization analysis indicates 
that del -1 mRNA is highly expressed in fetal cells. In 
addition, the del -I sequence is expressed in certain tumor 
cells . 

5 In order to clone the full length cDNA sequence from any 

species encoding the entire del-1 cDNA or to clone variant 
forms of the molecule, labeled DNA probes made from nucleic 
acid fragments corresponding to any murine and human of the 
partial cDNA disclosed herein may be used to screen a cDNA 

0 library. More specifically, oligonucleotides corresponding 
to either the 5' or 3 ' terminus of the cDNA sequence may be 
used co obtain longer nucleotide sequences. Briefly, the 
library may be plated out to yield a maximum of 30,000 pfu 
for each 150 mm plate. Approximately 40 slates may be 

5 screened. The plates are incubated at 37=C until the plaques 
reach a diameter of 0.25 ran or are just beginning to make 
contact with one another (3-8 hours) . Nylon filters are 
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placed onto the soft cop agarose and after 60 seconds, the 
filters are peeled off and floated on a DNA denaturing 
solution consisting of 0.4N sodium hydroxide. The fillers 
are then immersed in neutralizing solution consisting of 1M 
5 Tris HCL, pK 7.5, before being allowed to air dry. The 
filters are prehybridized in casein hybridization buffer 
containing 10% dextran sulfate, 0 . 5M NaCl , SOmJM Tris HCL , pH 
7.5, 0.1% sodium pyrophosphate, 1% casein, 1% SDS, and 
denatured salmon sperm DNA at 0.5 mg/ml for 6 hours at 6 0 °C. 

10 The radiolabelled probe is then denatured by heating to 95 °C 
for 2 minutes and then added to the prehybridization solution 
containing the filters. The filters are hybridized at 60°C 
for 16 hours. The filters are then washed in IX wash mix 
(10X wash mix contains 3M NaCl, 0 . 6M Tris base, and 0 . 02M 

15 SDTA) twice for 5 minutes each at room temperature, then in 
IX wash mix containing 1% SDS at 60 °C for 3 0 minutes, and 
finally in 0 . 3X wash mix containing 0.1% SDS at 60°C for 30 
minutes . The filters are then air dried and exposed to x-ray 
film for autoradiography. After developing, the film is 

20 aligned with the filters to select a oositive oiaaue . If a 
single, isolated positive plaque cannot be obtained, the agar 
plug containing the plaques will be removed and placed in 
lambda dilution buffer containing 0 . 1M NaCl, 0.01M magnesium 
sulfate, 0.035M Tris HC1 , pH 7.5, 0.01% gelatin. The phage 

25 may then be replated and rescreened to obtain single, well 
isolated positive plaques. Positive plaques may be isolated 
and the cDNA clones sequenced using primers based on the 
known cDNA sequence. This step may be repeated until a full 
length cDNA is obtained. 

3 0 It may be necessary to screen multiple cDNA libraries 

from different tissues to obtain a full length cDNA. In the 
event that it is difficult to identify cDNA clones encoding 
the complete 5' terminal coding region, an often encountered 
situation in cDNA cloning, the RACE (Rapid Amplification of 

3 5 cDNA Ends) technique may be used. RACE is a proven ?CR-based 
strategy for amplifying the 5' end of incomplete cDNAs . 5 ' - 
RACE -Ready cDNA synthesized from human fetal liver containing 
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a unique anchor sequence is commercially available 
(Ciontech) . To obtain che 5 ' end of the cDNA, PCR is carried 
out on 5 ' -RACE -Ready cDNA using che provided anchor primer 
and the 3' primer. A secondary ?CR reaction is then carried 

5 out using the anchored primer and a nested 3' primer 

according to the manufacturer's instructions. Once obtained, 
the full length cDNA sequence may be translated into amino 
acid sequence and examined for certain landmarks such as a 
continuous open reading frame flanked by translation 

0 initiation and termination sites, EGF-like domain, 

discoidin I-like domain, a potential signal sequence and 
transmembrane domain, and finally overall structural 
similarity to the del -2 genes disclosed herein. 

5 5.2. EXPRESSION OF DSL - 1 SEQUENCE 

In accordance with the invention, a del-1 polynucleotide 
sequence which encodes the Del-l protein, mutant 
polypeptides, peptide fragments of Del-1, Del-1 fusion 
proteins or functional equivalents thereof, may be used to 
0 generate recombinant DNA molecules that direct the expression 
of Del-l protein, Del-1 peptide fragments, fusion proteins or 
a functional equivalent thereof, in appropriate host cells. 
Such del-1 polynucleotide sequences, as well as other 
polynucleotides which selectively hybridize to at least a 
5 part of such del-l polynucleotides or their complements, may 
also be used in nucleic acid hybridization assays, Southern 
and Northern blot analyses, etc. 

Due to the inherent degeneracy of the genetic code, 
other DNA sequences which encode substantially the same or a 
0 functionally equivalent amino acid sequence, may be used in 
the practice of the invention for the cloning and expression 
of the Del-1 protein. Such DNA sequences include those which 
are capable of hybridizing to the murine and/or human del -I 
sequences under stringent conditions. The phrase "stringent 
conditions" as used herein refers to those hybridizing 
conditions that (1) employ low ionic strength and high 
temperature for washing, for example, 0.015 M NaCl/0.0015 M 



sodium citrate/0.1% SDS ac 50°C; (2) e^loy durina 
hybridization a denaturing agent such as formamide, for 
example, 50% (vol/vol) formamide with 0.1% bovine serum 
albumin/0.1% Ficoll/0 . 1% polyvinylpyrrolidone/5 0 mM sodium 

5 phosphate buffer at pH 6.5 with 750 mM NaCl , 75 mM sodium 

citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M 
NaCl, 0.075 M Sodium pyrophosphate, 5 x Denhardt ' s solution, 
sonicated salmon sperm DNA (50 /xg/ml) , 0.1% SDS, and 10% 
dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC and 

0 0.1% SDS. 

Altered DNA sequences which may be used in accordance 
with the invention include deletions, additions or 
substitutions of different nucleotide residues resulting in a 
sequence that encodes the same or a functionally equivalent 

5 gene product. The gene product itself may contain deletions, 
additions or substitutions of amino acid residues within a 
Del-l sequence, which result in a silent change thus produ- 
cing a functionally equivalent Del-l protein. Such amino 
acid substitutions may be made on the basis of similarity in 

0 polarity, charge, solubility, hydrophobicity , hydrophilicity , 
and/or the amphipathic nature of the residues involved. For 
example, negatively charged amino acids include aspartic acid 
and glutamic acid; positively charged amino acids include 
lysine, histidine and arginine; amino acids with uncharged 

5 polar head groups having similar hydrophilicity values 
include the following: glycine, asparagine, glutamine, 
serine, threonine, tyrosine; and amino acids with nonpolar 
head groups include alanine, valine, isoleucine, leucine, 
phenylalanine, proline, methionine, tryptophan. 

0 The DNA sequences of the invention may be engineered in 

order to alter a del-l coding sequence for a variety of ends, 
including but noc limited to, alterations which modify 
processing and expression of the gene product. For example, 
mutations may be introduced using techniques which are well 

5 known in the art, e.g., site-directed mutagenesis, to insert 
new restriction sites, to alter glycosylat ion patterns, 
phosphorylation, etc . 
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Based on the domain organization of the Del-1 protein, a 
large number of Del-i mucar.: polypeptides can be constructed 
by rearranging the nucleotide sequences that encode the Del-1 
domains. Since the SGF-like domains of Del-1 are known to be 
5 involved in protein binding, Del-1 may directly bind to other 
cell surface receptors or extracellular matrix proteins via 
these domains, thereby controlling ceil fate determination or 
differentiation in a manner similar to Notch and Notch 
ligands . Additionally, the RGD sequence in the second EG? - 

10 like domain is known to bind to certain integrins , thus Del-1 
may regulate cell adhesiveness, migration, differentiation 
and viability via this sequence. The discoidin I -like 
domains of Del-1 are involved in a separate type of ceil 
binding activity. In accordance with the observed properties 

15 of Factors V and VIII, Del-1 may directly bind proteoglycans 
in the extracellular matrix or on the cell surface via those 
domains. Therefore, the combination of various domains of 
full-length Del-1 permits the molecule to perform diverse 
types of binding. For example, the major form of Del-1 may 

2 0 be able to cluster integrin receptors by way of both EG?- like 

and discoidin I-like domains. In contrast, smaller fragments 
of Del-1 or its minor form would bind integrins without the 
ability to induce receptor clustering, and thus induce 
alternative signals to cells. 
25 in view of the foregoing, the Del-1 mutant polypeptides 

can be generated and their functional activities compared. 
In addition to the minor form, Del-l mutants may be 
constructed to contain only the SGF-like or discoidin I-like 
domains. Additionally, smaller polypeptides can be made from 

3 0 constructs that contain any one of the EG? -like and discoidin 

I-like domains. 

In another embodiment of che invention, a del-1 or a 
modified del-1 sequence may be ligated co a heteroloaous 
sequence to encode a fusion protein. For example, for 
35 screening of peptide libraries for molecules that bind Del-1, 
it may be useful to encode a chimeric Del-1 protein 
expressing a heterologous epitope that is recognized by a 



commercially available antibody. A fusion protein may also 
be engineered to contain a cleavage sine located between a 
Del-1 sequence and the heterologous protein sequence, so tha 
the Del-1 may be cleaved away from che heterologous moiety. 

In an alternate embodiment of the invention, the coding 
sequence of Del-1 could be synthesized in whole or in part, 
using chemical methods well known in the art. See, for 
example, Caruthers ec al . , 1980, Nuc . Acids Res. Symp . Ser . 
7:215-233; Crea and Horn, 180, Nuc. Acids Res. 9 (10) i2331; 
Matteucci and Caruthers, 1980, Tetrahedron Letter 21:119; an< 
Chow and Kempe , 1981, Nuc. Acids Res. 9 (12 ): 2807 -2817 . 
Alternatively, the protein itself could be produced using 
chemical methods to synthesize an Del-1 amino acid sequence 
in whole or in part. For example, peptides can be 
synthesized by solid phase techniques, cleaved from the 
resin, and purified by preparative high performance liquid 
chromatography. (e.g., see Creighton, 1983, Proteins 
Structures And Molecular Principles, W.H. Freeman and Co., 
N.Y. pp. 50-60). The composition of the synthetic peptides 
may be confirmed by amino acid analysis or sequencing (e.g., 
the Edman degradation procedure; see Creighton, 1983, 
Proteins, Structures and Molecular Principles , W.H. Freeman 
and Co., N.Y., pp. 34-49) . 

In order to express a biologically active Del-1, the 
nucleotide sequence coding for Del-1, or a functional 
equivalent, is inserted into an appropriate expression 
vector, i.e., a vector which contains the necessary elements 
ror the transcription and translation of the inserted coding 
sequence. The del-1 gene products as well as host cells or 
cell lines transf ected or transformed with recombinant del-1 
expression vectors can be used for a variety of purposes. 
These include but are not limited to generating antibodies 
(i.e., monoclonal or polyclonal) that competitively inhibit 
activity of Del-1 protein and neutralize its activity; and 
antibodies that mimic the activity of Del-1 binding partners 
such as a receptor. Anti-Del -1 antibodies may be used in 
detecting and quantifying expression of Del-1 levels in cells 



and tissues such as endothelial cells and certain rumor 
cells, as well as isolating Del - 1 -positive cells. 

5.3. EXPRESSION SYSTEMS 
5 Methods which are well known to chose skilled in the ar 

can be used to construct expression vectors containing the 
del-! coding sequence and approoriate 

transcriptional/translational control signals. These methods 
include in vitro recombinant DNA techniques, synthetic 
LO techniques and in vivo recombination/genetic recombination. 
See, for example, the techniques described in Sambrook et 
al., 1989, Molecular Cloning A Laboratory Manual, Cold Spring 
Harbor Laboratory, N . Y . and Ausubel et al . , 1989, Current 
Protocols in Molecular Biology, Greene Publishing Associates 
.5 and Wiley Interscience , N.Y. 

A variety of host -expression vector systems may be 
utilized to express the del -J coding sequence. These include 
but are not limited to microorganisms such as bacteria 
transformed with recombinant bacteriophage DNA, plasmid DNA 
0 or cosmid DNA expression vectors containing the del -2 coding 
sequence; yeast transformed with recombinant yeast expression 
vectors containing the del-1 coding sequence; insect cell 
systems infected with recombinant virus expression vectors 
^ e : 3 - r baculovirus) containing the del-1 coding sequence ; 
5 plant cell systems infected with recombinant virus expression 
vectors (e^cN , cauliflower mosaic virus, CaMV; tobacco mosaic 
virus, TMV) or transformed with recombinant plasmid expres- 
sion vectors (e.g. , Ti plasmid) containing the del-1 coding 
sequence; or animal cell systems. The expression elements of 
0 these systems vary in their strength and specificities. 

Depending on the host/vector system utilized, any of a number 
of suitable transcription and translation elements, including 
constitutive and inducible promoters, may be used in the 
expression vector. For example, when cloning in bacterial 
5 systems, inducible promoters such as pL of bacteriophage X, 
plac, ptrp, ptac [ptrp-lac hybrid promoter; cytomegalovirus 
promoter) and the like may be used; when cloning in insect 



cell systems, promoters such as the baculovirus poly hear in 
promoter may be used; when cloning in plane cell systems, 
promoters derived from the genome of plan* cells (e.g. , heat 
shock promoters ; the promoter for the small subunit of 
RU3ISC0; the promoter for the chlorophyll a/j3 binding 
protein) or from plant viruses ( e.g. , the 35S RNA promoter of 
CaMV; the coat protein promoter of TMV) may be used; when 
cloning in mammalian cell systems, promoters derived from the 
genome of mammalian cells ( e.g. , metallothionein promoter) or 
from mammalian viruses ( e.g. , the adenovirus late promoter; 
the vaccinia virus 7.5K promoter) may be used; when 
generating cell lines that contain multiple copies of the 
del ~ 1 DNA, SV40-, 3?V- and E3V-based veccors may be used with 
an appropriate selectable marker. 

In bacterial systems a number of expression vectors may 

-+ mm *h 

be advantageously selected depending upon the use intended 
for the del -1 expressed. For example, when large quantities 
of del - 1 are to be produced for the generation of antibodies 
or to screen peptide libraries, vectors which direct the 
expression of high levels of fusion protein products that are 
readily purified may be desirable. Such vectors include but 
are not limited to the E. coli expression vector pUR278 
(Ruther et al . , 1983, SMBO J . 2:1791), in which the del -I 
coding sequence may be ligated into the vector in frame with 
the la.cZ coding region so that a hybrid AS~la.cZ protein is 
produced; pIN vectors (Inouye & Inouye, 1935, Nucleic acids 
Res. 13:3101-3109; Van Heeke & Schuster, 1989, J. Biol. Chem . 
264:5503-5503); and the like. pGEX vectors may also be used 
to express foreign polypeptides as fusion proteins with 
glutathione S- transferase (GST) . In general, such fusion 
proteins are soluble and can easily be purified from lysed 
cells by adsorption to glutathione -agarose beads followed by 
elution in the presence of free glutathione. The pGEX 
vectors are designed to include thrombin or factor Xa 
protease cleavage sites so thac the cloned polypeptide of 
interest can be released from the GST moiety. In particular, 
murine del -1 major and minor coding sequences have been 



- 27 - 



inserted in pET28a (Novagen Inc.) which contains a 

T7 promoter, and pMALC2 (New England Biolabs) . These vectors 

encode fusion proteins which can be readily ourified. 

In yeast, a number of vectors containing constitutive or 
5 inducible promoters may be used. For a review see, Current 
Protocols in Molecular Biology, Vol. 2, 1988, Ed. Ausubel et 
al . , Greene Publish. Assoc. & Wiley Interscience , Ch. 13; 
Grant et al . , 1987, Expression and Secretion Vectors for 
Yeast, in Methods in Enzymology , Eds. Wu & Grossman, 1987, 
.0 Acad. Press, N.Y., Vol. 153, pp. 516-544; Glover, 1986, DNA 
Cloning, Vol. II, IRL Press, Wash., D.C., Ch . 3; and Bitter, 
1987, Heterologous Gene Expression in Yeast:, Methods in 
Enzymology, Eds. Berger Sc Kimmel, Acad. Press, N.Y., Vol. 
152, pp. 673-684; and The Molecular Biology of the Yeast 
.5 Saccharomyces, 1982, Eds. Strathern et al . , Cold Spring 
Harbor Press, Vols. I and II. 

In cases where plant expression vectors are used, che 
expression of the del -1 coding sequence may be driven by any 
of a number of promoters. For example, viral promoters such 
0 as the 35S RNA and 19S RNA promoters of CaMV (Brisson et al . , 
1984, Nature 310:511-514), or the coat protein promoter of 
TMV (Takamatsu et al . , 1987, SMBO J. 6:307-311) may be used; 
alternatively, plant promoters such as the small sub unit of 
RUB I SCO (Coruzzi et al . , 1984, EM30 J. 3:1571-1680; Broglie 
5 et al., 1984, Science 224:838-843); or heat shock promoters, 
e - ? soybean hsp!7 . 5-E or hsp!7.3-3 (Gurley en al . , 1986, 
Mol . Cell. Biol. 6:559-565) may be used. These constructs 
can be introduced into plant cells using Ti plasmids, Ri 
plasmids, plant virus vectors, direct DNA transformation, 
0 microinjection, electroporation, etc. For reviews of such 
techniques see, for example, Weissbach £ Weissbach, 1933, 
Methods for Plant Molecular Biology, Academic Press, NY, 
Section VIII, pp. 421-453; and Grierson & Corey, 1988, Plant 
Molecular Biology, 2d Ed., Blackie, London," Ch. 7-9. 
5 An alternative expression system which could be used to 

express del -1 is an insect system. In one such system, 
Autographa calif ornica nuclear polyhidrosis virus (AcNPV) is 
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used as a vector to express foreign genes. The virus grows 
in Spodoptera frugipsrda cells. The del-1 coding sequence 
may be cloned into non-essencial regions (for example the 
polyhedrin gene) of the virus and placed under control of an 
5 AcNPV promoter (for example the polyhedrin promoter) 

Successful insertion of the del -I coding sequence will result 
in inacuivation of the polyhedrin gene and production of non- 
occluded recombinant virus ( i.e. , virus lacking the 
proteinaceous coat coded for by the polyhedrin gene) . These 
LO recombinant viruses are then used to infect: Spodoptera 
frugipsrda cells in which the inserted gene is expressed, 
(e^g^, see Smith et al . , 1983, J . Viol. 46:584; Smith, U.S. 
Patent No. 4,215,051). A commercially available baculovirus 
expression vector pFastBac 1 (Gibco BRL , Inc.) has been 
.5 constructed to contain the murine del-1 coding sequence. 
In mammalian host ceils, a number of viral based 
expression systems may be utilized. In cases where an 
adenovirus is used as an expression vector, the del-1 coding 
sequence may be ligated to an adenovirus 
0 transcription/ translation control complex, e.g., the late 
promoter and tripartite leader sequence. This chimeric gene 
may then be inserted in the adenovirus genome by in vitro or 
in vivo recombination. Insertion in a non-essential region 
of the viral genome (e.g., region El or E3) will result in a 
5 recombinant virus that is viable and capable of expressing 
del-1 in infected hosts. (e.g., See Logan Sc Shenk, 1984, 
Proc. Natl. Acad. Sci . USA 81:3655-3559). Alternatively, the 
vaccinia 7 . 5K promoter may be used. (See, e.g. , Mackett et 
al . , 1982,. Proc. Natl. Acad. Sci. USA 79:7415-7419/ Mackett 
0 et al., 1984, J. Virol. 49:857-864; Panicali et al . , 1982, 
Proc. Natl. Acad. Sci. USA 79:4927-4931). 

Additionally, both the murine del-1 and human codina 

— i 

sequences have been inserted in a mammalian expression 
vector, pcDNA3 (Invitrogen, Inc.), which is" under the control 
5 of the cytomegalovirus promoter. Regulatable expression 
vectors such as the tetracycline inducible vectors may also 



be used to express the coding sequences in a controlled 
fashion . 

Specific initiation signals may also be reauired for- 
efficient translation of inserted del -1 coding seauences. 
5 These signals include the ATG initiation codon and adjacent 
sequences. In cases where the entire del-1 gene, including 
its own initiation codon and adjacent sequences, is inserted 
into the appropriate expression vector, no additional 
translational control signals may be needed. However, in 
LO cases where only a portion of the del-1 coding sequence is 
inserted, exogenous translational control signals, including 
the ATG initiation codon, must be provided. Furthermore, the 
initiation codon must be in phase with the reading frame of 
the del-1 coding sequence to ensure translation of the entire 
.5 insert. These exogenous translational control signals and 
initiation codons can be of a variety of origins, both 
natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of appropriate transcription 
enhancer elements, transcription terminators, etc. (see 
0 Bittner et al . , 1987, Methods in Snzymol . 153:516-544). 

In addition, a host cell strain may be chosen which 
modulates the expression of the inserted seauences, or 
modifies and processes the gene product in the specific 
fashion desired. Such modifications ( e.g. , glycosylation) 
5 and processing (e.g. , cleavage) of protein products may be 
important for the function of the protein. The presence of 
several consensus N-glycosylation sites in the del-1 
extracellular domain support the possibility that proper 
modification may be important for Del-1 function. Different 
0 host cells have characteristic and specific mechanisms for 
the posc-translational processing and modification of 
proteins. Appropriate cell lines or host systems can be 
chosen to ensure the correct modification and orocessina of 
che foreign protein expressed. To this end, eukarvotic host 
5 cells which possess the cellular machinery for proper 
processing of the primary transcribe, glycosylation, and 
phosphorylation of the gene product may be used. Such 
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mammalian host cells include but are not: limited to CHO , 
VERO, 3HK, HeLa , COS , MDCK, 293, WI38, yolk sac ceils, etc. 

For long-term , high-yield production of recombinant 
proceins, stable expression is preferred- For example, cell 
lines which stably express the del -1 may be engineered. 
Rather than using expression vectors which contain viral 
origins of replication, host ceils can be transformed with 
the del -1 DNA controlled by appropriate expression control 
elements ( e.g. , promoter, enhancer, sequences, transcript ion- 
terminators, polyadenylation sices, etc.), and a selectable 
marker. Following the introduction of foreign DNA, 
engineered cells may be allowed to grow for 1-2 days in an 
enriched media, and then are switched to a selective media. 
The selectable marker in the recombinant plasmid confers 
resistance to the selection and allows cells to stably 
integrate the plasmid into their chromosomes and grow to form 
foci which in turn can be cloned and expanded into cell 
lines . This method may advantageously be used to engineer 
ceil lines which express the Del - 1 protein on the cell 
surface. Such engineered cell lines are particularly useful 
in screening for molecules or drugs chat affect del ~1 
function . 

A number of selection systems may be used, including but 
not limited to the herpes simplex virus thymidine kinase 
(Wigler, et al . , 1977, Cell 11:223), hypoxanthine -guanine 
phosphoribosyltransf erase (Szybalska & Szybalski, 1952, Proc . 
Natl. Acad. Sci . USA 48:2026), and adenine 

phosphoribosyltransf erase (Lowy , et al . , 1980, Cell 22:817) 
genes can be employed in tk", hgprt" or aprt" cells, 
respectively. Also, antimetabolite resistance can be used as 
the basis of selection for dhfr, which confers resistance to 
methotrexate (Wigler, et al . , 1980, Natl. Acad. Sci. USA 
77:3567; O ' Hare , et al . , 1981, Proc. Natl. Acad. Sci. USA 
78:1527); opt, which confers resistance to mycophenolic acid 

(Mulligan £ Berg, 1981), Proc. Natl. Acad. Sci. USA 78:2072); 
neo, which confers resistance to the aminoglycoside G-418 

(Colberre-Garapin, et al . , 1981, J. Mol . Biol. 150:1); and 



hygro , which confers resistance to hygromycin (Santerre, et 
al., 1984, Gene 30:147) genes. Recently, additional 
selectable genes have been described, namely trpB , which 
allows cells to utilize indole in olace of tryptophan; hisD 
5 which allows cells to utilize histinol in place of histidine 
(Hartman & Mulligan, 1988, Proc . Natl. Acad. Sci . USA 
85:8047); and ODC (ornithine decarboxylase) which confers 
resistance to the ornithine decarboxylase inhibitor, 2- 
(dif luoromethyl) -DL-ornithine , DFMO (McConlogue L . , 1987, In 
L0 Current Communications in Molecular Biology, Cold Spring 
Harbor Laboratory ed.) . 

5.4. IDENTIFICATION OF CELLS THAT EXPRESS DEL-1 
The host cells which contain the coding sequence and 
.5 which express a biologically active del -I gene product or 
fragments thereof may be identified by at least four general 
approaches; (a) DNA-DNA or DNA-RNA hybridization; (b) the 
presence or absence of "marker" gene functions; (c) assessing 
the level of transcription as measured by the expression of 
0 del -2 mRNA transcripts in the host cell; and (d) detection of 
the gene product as measured by immunoassay or by its 
biological activity. Prior to the identification of gene 
expression, the host cells may be first mutagenized in an 
effort to increase the level of expression of del -I, 
5 especially in cell lines that produce low amounts of del-l. 

In the first approach, the presence of the del-l coding 
sequence inserted in the expression vector can be detected by 
DNA-DNA or DNA-RNA hybridization using probes comprising 
nucleotide sequences that are homologous to the del-l coding 
0 sequence, respectively, or portions or derivatives thereof. 
In the second approach, the recombinant expression 
vector/host system can be identified and selected based upon 
the presence or absence of certain "marker" gene functions 
(e -^< thymidine kinase activity, resistance to antibiotics, 
5 resistance to methotrexate, transformation phenotype , 

occlusion body formation in baculovirus, etc.). For example, 
.he del-l coding sequence is inserted within a marker gene 
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sequence of the vector, recombinants containing the del - 1 
coding sequence can be identified by the absence of the 
marker gene function. Alternatively, a marker gene can be 
placed in tandem with the del -I sequence under the control of 
the same or different promoter used to control the expression 
of the del - 1 coding sequence. Expression of the marker in 
response to induction or selection indicates expression of 
the del -1 coding sequence. 

In the third approach, transcriptional activity for the 
del-1 coding region can be assessed by hybridization assays. 
For example, RNA can be isolated and analyzed by Northern 
blot using a probe homologous to the del-1 coding sequence or 
particular portions thereof. Alternatively, total nucleic 
acids of the host cell may be extracted and assayed for 
hybridization to such probes. Additionally, RT-PCR may be 
used to detect low levels of gene expression. 

In the fourth approach, the expression of the Del-1 
protein product can be assessed immunologically, for example 
by Western blots, immunoassays such as radioimmuno- 
precipitation, enzyme-linked immunoassays and the like. 
This can be achieved by using an anti-Del -1 antibody and a 
Del-l binding partner such as aV62 . Alternatively, the 
biologic activities of Del-1 can be determmined by assaying 
its ability to inhibit vascular morphogenesis of endothelial 
cells . 

5.5. USES OF DEL-1 ENGINEERED CSLL LINES 
In an embodiment of the invention, the Del-1 protein 
and/or cell lines that express Del-1 may be used to screen 
for antibodies, peptides, small molecules natural and 
synthetic compounds or other cell bound or soluble molecules 
that bind to the Del-1 protein. For example, ant i -Del - 1 
antibodies may be used to inhibit or stimulate Del-1 
function. Alternatively, screening of peptide libraries with 
recombinant ly expressed soluble Del-1 protein or cell lines 
expressing Del-1 protein may be useful for identification of 
therapeutic molecules that function by inhibiting or 



stimulating the biological activity of Del - 1 . The uses of 
the Del -1 protein and engineered cell lines, described in the 
subsections below, may be employed equally well for other 
members of the del -1 gene family in various species. 

In an embodiment of the invention, engineered cell lines 
which express most of the del -1 coding region or a portion of 
it fused to another molecule such as the immunoglobulin 
constant region (Hollenbaugh and Aruffo, 1992, Current 
Protocols in Immunology, Unit 10.19; Aruffo et al . , 1990, 
Cell 61:1303) may be utilized to produce a soluble molecule 
to screen and identify its binding partners . The soluble 
protein or fusion protein may be used to identify such a 
molecule in binding assays, affinity chromatography, 
immunoprecipitation , Western blot, and the like. 
Alternatively, portions of del -I may be fused to the coding 
sequence of the SGF receptor transmembrane and cytoplasmic 
regions. Assuming that Del-1 can function as a cell-bound 
receptor, this approach provides for the use of the EG? 
receptor signal transduction pathway as a means for detecting 
molecules that bind to Del-1 in a manner capable of 
triggering an intracellular signal. On the other hand, Del-1 
may be used as a soluble factor in binding to cell lines that 
express specific known receptors such as integrins . 
Synthetic compounds, natural products, and other sources of 
potentially biologically active materials can be screened in 
assays that are well known in the art. 

Random peptide libraries consisting of all possible 
combinations of amino acids attached to a solid phase support 
may be used to identify peptides that are able to bind to the 
ligand binding site of a given receptor or other functional 
domains of a receptor such as kinase domains (Lam, K.S. et 
al . , 1991, Nature 354: 32-34) . The screening of peptide 
libraries may have therapeutic value in the discovery of 
pharmaceutical agents that stimulate or inhibit the 
biological activity of receptors through their interactions 
with the given receptor. 



Identification of molecules that are able to bind to the 
Del-1 protein may be accomplished by screening a peptide 
library with recombinant soluble Del-1 protein. Methods for 
expression and purification of Del-1 are described in Section 
5 5.2, supra, and may be used to express recombinant full 
length del-1 or fragments of del-1 depending on the 
functional domains of interest. For example, the EGF-like 
and discoidin I/factor VIII domains of del-1 may be 
separately expressed and used to screen peptide libraries. 
10 To identify and isolate the peptide/solid phase support 

that interacts and forms a complex with Del-1, it is 
necessary to label or "tag" the Del-I molecule. The Del-1 
protein may be conjugated to enzymes such as alkaline 
phosphatase or horseradish peroxidase or to other reagents 
.5 such as fluorescent labels which may include fluorescein 
isothiocyanate (FITC) , phycoerychrin (?E) or rhodamine . 
Conjugation of any given label to Del-1 may be performed 
using techniques that are well known in the art. 
Alternatively, del-1 expression vectors may be engineered to 
0 express a chimeric Del-1 protein containing an epitope for 
which a commercially available antibody exist. The epitope 
specific antibody may be tagged using methods well known in 
the art including labeling with enzymes, fluorescent dyes or 
colored or magnetic beads. 
5 The "tagged" Del-1 conjugate is incubated with the 

random peptide library for 3 0 minutes to one hour at 22 °C to 
allow complex formation between Del-1 and peptide species 
within the library. The library is then washed to remove any 
unbound protein. If Del-1 has been conjugated to alkaline 
0 phosphatase or horseradish peroxidase the whole library is 
poured into a petri dish containing substrates for either 
alkaline phosphatase or peroxidase, for examole, 5-bromo-^- 
chloro-3 -indoyl phosphate (BCI?) or 3,3' ,4,4"- 
diaminobenzidine (DAB) , respectively. After incubating for 
5 several minutes, the peptide/solid phase-Del-1 complex 
changes color, and can be easily identified and isolated 
physically under a dissecting microscope with a 



micromanipulator- If a fluorescent tagged Del - 1 molecule has 
been used, complexes may be isolated by fluorescence 
activated sorting. If a chimeric Del- 1 protein expressing a 
heterologous epitope has been used, detection of the 
5 peptide/Del - 1 complex may be accomplished by using a labeled 
epitope specific antibody. Once isolated, the identity of 
the peptide attached to the solid phase support may be 
determined by peptide sequencing. 

In addition to using soluble Del - 1 molecules, in another 

10 embodiment, it is possible to detect peptides that bind to 

cell surface recentors using intact cells. The use of intact 
cells is preferred for use with receptors that are mult i - 
subunits or labile or with receptors that require the lipid 
domain of the cell membrane to be functional . Methods for 

15 generating cell lines expressing del -1 are described in 

Section 5.3. The cells used in this technique may be either 
live or fixed cells. The cells may be incubated with the 
random peptide library and bind to certain peptides in the 
library to form a "rosette" between the target cells and the 

2 0 relevant solid phase support /peptide . The rosette can 

thereafter be isolated by differential centrif ugation or 
removed physically under a dissecting microscope . 

As an alternative to whole cell assays for membrane 
bound receptors or receptors that require the lipid domain of 
25 the cell membrane to be functional, the receptor molecules 
can be reconstituted into liposomes where label or "tag" can 
be attached. 

Various procedures known in the art may be used for the 
production of antibodies to epitopes of the natural and 

3 0 recombinant ly produced Del - 1 protein. Such antibodies 

include but are not limited to polyclonal, monoclonal, 
chimeric, single chain, Fab fragments and fragments produced 
by an Fab expression library. Neutralizing antibodies i.e. , 
those which compete for the iigand binding site of the Del -I 
35 protein are especially preferred for diagnostics and 
therapeutics . 
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Monoclonal antibodies that bind Del - 1 may be 
radioactively labeled allowing one to follow their location 
and distribution in the body after injection. Radioisotope 
tagged antibodies may be used as a non-invasive diagnostic 
tool for imaging de novo cells of tumors and metastases. 

Immunotoxins may also be designed which target cytotoxic 
agents to specific sites in the body. For example, high 
affinity Del - 1 specific monoclonal antibodies may be 
covalently comolexed to bacterial or slant toxins, such as 
diphtheria toxin, ricin. A general method of preparation of 
antibody/hybrid molecules may involve use of thiol - 
crosslinking reagents such as S?DP , which attack the primary 
amino groups on the antibody and by disulfide exchange, 
attach the toxin to the antibody. The hybrid antibodies may 
be used to specifically eliminate Del-1 expressing tumor 
cells . 

For the production of antibodies, various host animals 
may be immunized by injection with the recombinant or 
naturally purified Del-1 protein, fusion protein or peptides, 
including but not limited to rabbits, mice, rats, etc. 
Various adjuvants may be used to increase the immunological 
response, depending on the host species, including but not 
limited to Freund's (complete and incomplete), mineral gels 
such as aluminum hydroxide, surface active substances such as 
lysolecithin, pluronic polyois, polyanions, peptides, oil 
emulsions, keyhole limpet hemocyanin, dinitrophenol , and 
potentially useful human adjuvants such as BCG (bacilli 
Calmette-Guerin) and Coryn eba c t er i urn parvum. 

Monoclonal antibodies to Del-1 may be prepared by using 
any technique which provides for the production of antibody 
molecules by continuous ceil lines in culture. These include 
but are not limited to the hybridoma technique originally 
described by Kohler and Milstein, (Nature, 1975, 256:495- 
497), the human B -ceil hybridoma technique (Kosbor et al . , 
1983, Immunology Today, 4:72; Cote et al . , 19S3, ?roc . Natl. 
Acad. Sci., 80:2026-203 0) and the E3V- hybridoma technique 
(Cole et al . , 1985, Monoclonal Antibodies and Cancer Therapy, 
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Alan R. Liss, Inc., pp. 77-96) . In addition, techniques 
developed for the production of "chimeric antibodies" 
(Morrison et al . , 1984, Proc. Natl . Acad. Sci . , 81:6851-6855 ; 
Neuberger et al., 1984, Nature, 312:604-608; Takeda et al . , 
5 1985, Nature, 314:452-454) by splicing the genes from a mouse 
antibody molecule of appropriate antigen specificity together 
with genes from a human antibody molecule of appropriate 
biological activity can be used. Alternatively, techniques 
described for the production of single chain antibodies (U.S. 
10 Patent 4,946,778} can be adapted to produce Del - 1-specif ic 
single chain antibodies . 

Hybridomas may be screened using enzyme-linked 
?fl immunosorbent assays (ELISA) in order to detect cultures 

j,l secreting antibodies specific for refolded recombinant Del-1 . 

i: 4 15 Cultures may also be screened by ELISA to identify those 

[" cultures secreting antibodies specific for mammalian-produced 

U& Del-1. Confirmation of antibody specificity may be obtained 

^ by western blot using the same antigens. Subsequent ELISA 

|~= testing may use recombinant Del-1 fragments to identify the 

Hi 20 specific portion of the Del-1 molecule with which a 

til 

f fi monoclonal antibody binds. Additional testing may be used to 

?fl identify monoclonal antibodies with desired functional 

characteristics such as staining of histological sections, 
immunoprecipitation of Del-1, or neutralization of Del-1 

2 5 activity. Determination of the monoclonal antibody isotype 

may be accomplished by ELISA, thus providing additional 
information concerning purification or function. 

Antibody fragments which contain specific binding sites 
of Del-l may be generated by known techniques. For example, 

3 0 such fragments include but are not limited to: the F(ab') 2 

fragments which can be produced by pepsin digestion of the 
antibody molecule and the Fab f raqment s which can be 
generated by reducing the disulfide bridges of the F(ab') 2 
fragments. Alternatively, Fab expression libraries may be 
35 constructed (Huse et al . , 1989, Science, 246:1275-1281) to 
allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity to Del-1. Ant i- Del - 1 
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antibodies may be used to isolate Del -1 -expressing cells or 
eliminate such cells from a cell mixture. 

5.6. USES OF DEL- 1 POLYNUCLEOTIDE 

5 A del -2 polynucleotide may be used for diagnostic and/or 

therapeutic purposes. For diagnostic purposes, a del-1 
polynucleotide may be used to detect del-1 gene expression or 
aberrant del -I gene expression in disease states. Included 
in the scope of the invention are oligonucleotide sequences, 
10 that include antisense RNA and DNA molecules and ribozymes, 
that function to inhibit translation of del-1. 

5.6.1. DIAGNOSTIC USES OF A DEL-1 POLYNUCLEOTIDE 

A del-1 polynucleotide may have a number of uses for the 
15 diagnosis of diseases resulting from aberrant expression of 
del-1. For example, the del-1 DNA sequence may be used in 
hybridization assays of biopsies or autopsies to diagnose 
abnormalities of del-1 expression; e.g., Southern or Northern 
analysis, including in situ hybridization assays. Such 

2 0 techniques are well known in the art, and are in fact the 

basis of many commercially available diagnostic kits. 

5 - 6 - 2 - THERAPEUTIC USES OF A DEL-1 POLYNUCLEOTIDE 

A del-1 polynucleotide may be useful in the treatment of 
25 various abnormal conditions. By introducing gene sequences 
into cells, gene therapy can be used to treat conditions in 
which the cells do not proliferate or differentiate normally 
due to underexpression of normal del-1 or expression of 
abnormal/inactive del-1. In some instances, the 

3 0 polynucleotide encoding a del -I is intended to replace or act 

m the place of a functionally deficient endoaenous a-^- 
Alternatively, abnormal conditions characterized by 
overproliferation can be treated using the gene therapy 
techniques described below. 
3 5 Abnormal cellular proliferation is an important 

component of a variety of disease states. Recombinant gene 
therapy vectors, such as viral vectors, may be engineered to 



express variant, signalling incompetent forms of Del - 1 which 
may be used to inhibit the activity of the naturally 
occurring endogenous Del - 1 . A signalling incompetent form 
may be, for example, a truncated form of the protein that is 
5 lacking all or part of its signal transduction domain. Such 
a truncated form may participate in normal binding to a 
substrate but lack signal transduction activity. Thus 
recombinant gene therapy vectors may be used therapeutically 
for treatment of diseases resulting from aberrant expression 

10 or activity of an Del-1. Accordingly, the invention provides 
a method of inhibiting the effects of signal transduction by 
an endogenous Del-1 protein in a cell comprising delivering a 
DNA molecule encoding a signalling incompetent form of the 
Del-1 protein to the cell so that the signalling incompetent 

15 Del-1 protein is produced in the cell and competes with the 
endogenous Del-1 protein for access to molecules in the Del-1 
protein signalling pathway which activate or are activated by 
the endogenous Del-1 protein. 

Expression vectors derived from viruses such as 

20 retroviruses, vaccinia virus, adeno-associated virus, herpes 
viruses, or bovine papilloma virus, may be used for delivery 
of recombinant Del-1 into the targeted cell population . 
Methods which are well known to those skilled in the art can 
be used to construct recombinant viral vectors containing an 

25 del -1 polynucleotide sequence. See, for example, the 

techniques described in Maniatis ec al . , 1989, Molecular 
Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, 
N.Y. and Ausubel et al . , 198 9, Current Protocols in Molecular 
Biology, Greene Publishing Associates and Wiley Interscience , 

3 0 N.Y. Alternatively, recombinant Del-1 molecules can be 

reconstituted into liposomes for delivery to target cells . 

Oligonucleotide sequences, that include anti-sense RNA 
and DNA molecules and ribozymes that function to inhibit the 
translation of a del-1 mRNA are within the scope of the 

35 invention. Ant i- sense RNA and DNA molecules act to directly 
block the translation of mRNA by binding to targeted mRNA and 
preventing protein translation. In regard to antisense DNA, 
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ol igodeoxyribonucleot ides derived from the translation 
initiation site, e.g., between -10 and +10 regions of a del -1 
nucleotide sequence, are preferred. 

Ribozymes are enzymatic RNA molecules capable of 
catalyzing the specific cleavage of RNA. The mechanism of 
ribozyme action involves sequence specific hybridization of 
the ribozyme molecule to complementary target RNA, followed 
by endonucleolycic cleavage. Within the scope of the 
invention are engineered hammerhead motif ribozyme molecules 
that specifically and efficiently catalyze endonucleolytic 
cleavage of del -2 RNA seouences . 

Specific ribozyme cleavage sites within any potential 
RNA target are initially identified by scanning the target 
molecule for ribozyme cleavage sites which include the 
following sequences, GUA, GUU and GUC . Once identified, 
short RNA sequences of between 15 and 2 0 ribonucleotides 
corresponding to the region of the target gene containing the 
cleavage site may be evaluated for predicted structural 
features such as secondary structure that may render the 
oligonucleotide sequence unsuitable. The suitability of 
candidate targets may also be evaluated by testing their 
accessibility to hybridization with complementary 
oligonucleotides, using ribonuclease protection assays. 

Both anti- sense RNA and DNA molecules and ribozymes of 
the invention may be prepared by any method known in the art 
for the synthesis of RNA molecules. These include techniques 
for chemically synthesizing oligodeoxyribonucleotides well 
known in the art such as for example solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA 
molecules may be generated by in vitro and in vivo 
transcription of DNA sequences encoding the antisense RNA 
molecule . Such DNA seauences may be incorporated into a wide 

-A ^ 

variety of vectors which incorporate suitable RNA polymerase 
promoters such as the T7 or SPG polymerase promoters . 
Alternatively, antisense cDNA constructs that synthesize 
antisense RNA cons ti tut ively or indue ibly, depending on the 
promoter used, can be introduced stably into cell lines. 



Various modifications zo the DNA molecules may be 
introduced as a means of increasing intracellular stability 
and half -life. Possible modifications include but are not 
limited to the addition of flanking sequences of ribo- or 
5 deoxy- nucleotides to the 5' and/or 3' ends of the molecule 
or the use of phosphorothioace or 2' 0 -methyl rather than 
phosphodiesterase linkages within the oligodeoxyribonucleo- 
tide backbone. 

Methods for introducing polynucleotides into such cells 
10 or tissue include methods for in vitro introduction of 
polynucleotides such as the insertion of naked 
polynucleotide, i.e., by injection into tissue, the 
introduction of a del-1 polynucleotide in a cell ex vivo, 
i.e., for use in autologous cell therapy, the use of a vector 
15 such as a virus, retrovirus, phage or plasmid, etc. or 

techniques such as electroporation which may be used in vivo 
or ex vivo. 

5.7. USES OF DEL-1 PRQT5JN 
20 Analysis of /3-gal expression in transgenic mice in which 

/J -gal gene expression is controlled by the del-1 enhancer 
indicates that the del-1 gene is activated in endothelial 
cells undergoing vasculogenesis . Vasculogenesis refers no 
the development of blood vessels de novo from embryonic 
25 precursor cells. The related process of angiogenesis is the 
process through which existing blood vessels arise by 
outgrowth from preexisting ones. Vasculogenesis is limited 
to the embryo while angiogenesis continues throughout life as 
a wound healing response or to increase oxygenation of 
30 chronically stressed tissues (Pardanaud et al . , 1989 

Development 105:473; Granger 1994, Cell and Mol . Biol. Res. 
40 : 31) . 



is likely that Del -I functions during embryonic 
vasculogenesis and in angiogenesis. For therapeutic use, it 
35 is essential chat Del-1, portions of Del-1 or antibodies that 
block Del-l, may interact with angiogenic cells since it is 
stimulation or inhibition of these cells that is clinically 



relevant. Manipulation of Del - I function may have 
significant effects on angiogenesis if Del - 1 normally 
participates in this process. 

The working examples in Sections 9 and 10 demonstrate 
5 that Del-1 exhibits an inhibitory effect on angiogenesis, 

which may be mediated by its interaction with aV(33 -expressing 
endothelial cells. Del-1 protein or recombinant proteins 
consisting of portions of Del-1 may function to suppress 
angiogenesis or induce endothelial cell apoptosis. This 

10 function could be clinically useful to prevent 

neovascularization of tissues such as tumor nodules . It has 
been demonstrated that inhibition of angiogenesis is useful 
in preventing tumor metastases (Fidler and Ellis, 1994, Cell 
79:185). Recently, O'Reilly ei al (1994, Cell 79:315) 

15 reported that a novel angiogenesis inhibitor isolated from 
tumor-bearing mice, angiostatin, specifically inhibited 
endothelial cell proliferation. Jn vivo, angiostatin was a 
potent inhibitor of neovascularization and growth of tumor 
metastases. In a related report, Brooks et al (1994, Cell 

20 79:115) showed that integrin antagonises promoted tumor 

regression by inducing apoptosis of angiogenic blood vessels. 
These integrin antagonists included cyclic peptides 
containing an RGD amino acid sequence. Since Del-1 contains 
an RGD sequence, the use of this portion of the Del-1 

25 molecule may have similar effects. 

Manipulation of the discoidin I/factor VHI-like domains 
of Del-1 may also be used to inhibit angiogenesis. 
Apolipoprotein E (ApoE) has been shown to inhibit basic 
fibroblast growth factor (bFGF) -stimulated proliferation of 

3 0 endothelial cells in vitro (Vogel et al . , 1994, J. Cell. 
Biochem. 54:299) . This effect could also be produced with 
synthetic peptides based on a portion of the AooE seauence . 
These results could be due to direct: competition of ApoE with 
growth factors for binding to heparin sulfate proteoglycans, 

35 or through disruption by ApoE of cell-matrix interactions. 
It has been proposed that discoidin I /factor VII I -like 
domains such as those in Del-1 bind to proteoglycans. In 
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addition, Del - 1 is similar in structure to a number of 
extracellular matrix proteins. Thus, Del-i may be 
manipulated to effect the activity of growch factors such as 
bFGF or to alter interactions between endothelial cells and 
5 the extracellular matrix. 

The anti-angiogenic activity of Del-1 may be used to 
treat abnormal conditions that result from angiogenesis . 
These conditions include, but are not limited to, cancer, 
diabetic retinopathy, rheumatoid arthritis and endometriosis. 
10 Additionally, the removal or inhibition of Del-1 in 

situations where it naturally inhibits blood vessel formation 
may be used to promote angiogenesis. These conditions 

: 1 

#fl inlcude, but are not limited to, cardiac ischemia, thrombotic 

ss z 
* * : 

\ m * stroke, would healing and peripheral vascular disease. 

*'4 ' 15 Furthermore, Del-1 may be used to stimulate bone formation. 

SfjjJ 

6. EXAMPLE: MOLECULAR CLONING OF HUMAN AND MURINE DEL-1 

NUCLEOTIDE SEQUENCES 

6.1. MATERIALS AND METHODS 

6-1,1. GENERATION OF TRANSGENIC MICE 

The SLM275 transgenic mouse line was generated in a 
C57BL6xDBA/Fl background, and the transgenic animals had been 
crossed back against similar 36D2F1 animals for maintenance 
of the line and the generation of embryos. This transaene 
had been maintained in the heterozygous state, and these 
heterozygous mice had normal breeding capacity. However, 
preliminary experiments indicated that these animals were not 
viable in the homozygous state. 

6,1.2. MOLECULAR CLONING OF DEL-1 
A genomic library was constructed from high 
molecular weight DNA isolated from the kidney of a SLM275 
transgenic animal. This DNA was subjected .to partial 
digestion with Sau3A to obtain an average size of 20 kb, 
subjected to a partial fill-in reaction, and then cloned into 
a similarly treated iambdaphage vector (lambdaFix, 
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Stratagene) . The library construcced in this fashion had a 
base of approximately 2 million clones. These clones were 
amplified and the library stored at -70°C. A 200 baseoair 
(bp) probe derived from the SV4 0 polyadenylation signal of 
5 the transgene was used as a probe and allowed the isolation 
of 12 lambdaphage clones. Six of these clones were randomly 
chosen for further investigation. These clones were mapped, 
and restriction fragments which did not contain transgene 
sequence identified. The clones were divided into two groups 
10 on the basis of common non-transgenic fragments. One such 
fragment from the first group of phage allowed specific 
hybridization to genomic blots and provided evidence that: in 
was derived from a region adjacent to the integration site. 
Genomic DNA from a non-transgenic mouse of the same generic 
15 background (B6D2F1) was compared to that of a SLM2 75 

transgene animal by hybridization to this probe. Rearranged 
bands representing fragments disrupted by transgene 
integration were seen in the SLM275 lanes with both EcoRl and 
BamHl digests. The flanking sequence probe was employed to 

2 0 screen a commercially available lambdaFixII genomic library 

constructed from the 129SV mouse strain (Stratagene) . 

A murine cDNA fragment was used as a probe to identify 
cDNA clones of its human homolog. The probe corresponded to 
nucleotides 1249 through 1566 in the murine del-1 major 
25 sequence. Human cDNA clones were isolated from a human fetal 
lung cDNA library (Clonetech, Inc.) following standard 
procedures . 

6.2. RESULTS 

3 0 A transgenic mouse line was created through a fortuitous 

enhancer trap event. The original studies were designed to 
map the cell - specif ic and developmental -specif ic regulatory 
regions of the mouse SPARC promoter, 2.2 kilobases (kb) of 
the SPARC 5' flanking sequence were placed upstream of the E. 
35 coli la.cZ (beta-galactosidase or j3-gal) reporter gene. The 
mouse SPARC gene is normally expressed in a wide variety of 
adult and embryonic cells which synthesize a specific 



extracellular matrix (Nomura et al . , 1989, J. Biol. Cham. 
264:12201-12207) . However, one of the founder mouse lines 
showed a highly restricted pattern of expression quite 
distinct: from the native SPARC gene. Expression of the la.cZ 
5 reporter in this particular line of mice referred to as 
SLM275 was seen very early in cells of the endothelial 
lineage. Whole mount lacZ staining was employed for initial 
studies, and these embryos were subsequently sectioned and 
examined by light microscopy. The first cells to stain were 
0 endothelial cells forming the endocardium, the outflow tract, 
and the developing intervertebral vessels. Staining appeared 
to be predominantly restricted to endothelial cells 
associated with forming major blood vessels. Expression 
began to decline after 11.5 days pc . 
5 The genomic region targeted by this transgene is herein 

referred to as del -1 . initial cloning experiments were aimed 
at isolating genomic sequences flanking the transgene 
integration site. A number of lambdaphage clones were 
isolated and mapped (Figure 1) . Approximately 4 0 kb of the 
) wild-type del -2 sequence was contained in these donas. By 
probing Southern blots containing restriction digests of 
these lambdaphages with non- transgenic fragments from the 
SLM275 lambdaphage clones, the site of transgene integration 
was mapped. insertion of the transgene complex was 
> associated with the deletion of approximately 8 kb of DNA . 
There were approximately 2 5 kb of flanking sequence on one 
side of the integration, and approximately 5 kb of the other 
flanking saquanca contained on these clones. 

Exon trapping was used to evaluate genomic fragments for 
the presence of exons . This approach utilized a vector with 
a constitutive promoter driving transcription through a DNA 
fragment containing a splice donor site and a splice acceptor 
site. Between these splicing signals was a common cloning 
site where the genomic DNA fragment to be evaluated was 
cloned. Exons within this fragment would be spliced into the 
transcript when the construct was transfected into eukaryotic 
cells, such as COS ceils. The transcript containing the 



trapped exon sequence was rescued from zhe COS cells by 
reverse transcripcase polymerase chain reaction (RT-PCR) . 
PCR amplified DNA was cloned and evaluated. 

A 160 bp exon was trapped from a fragment of genomic DNA 
located approximately 10 kb from che "left" integration site. 
Nucleotide sequence of the trapped exon was employed to 
screen various nucleic acid databanks through the BLAST 
routine at the NC3I, revealing no other gene with significant 
nucleic acid homology. The deduced amino acid sequence of 
the single open reading frame was subsequently employed in 
databank searches . These revealed that che protein domain 
encoded in the trapped exon was similar in part to domains in 
a number of proteins, including Factor V, Factor VIII and 
discoidin I (Figure 2) {Jenny en al . , 1987, Proc . Natl. Acad. 
Sci. U.S.A. 84:4846-4850; Poole et al . , 1981, J. Mol . Biol. 
153:273-289; Toole et al . , 1984, Nature 312:342-347). The 
protein which was most similar was milkfac globule protein, 
which had been found on the surface of mammary epithelial 
cells (1994, WO 94/11508). It has been hypothesized that the 
discoidin I - like domain in this protein allows it to localize 
to the surface of nhe epithelial cell (Larocca et al . , 1991, 
Cancer Res. 51:4994-4998; Stubbs et al . , 1990, Proc . Natl. 
Acad. Sci. U.S.A. 37:8417-8421). The homologous regions of 
Factor V and Factor VIII have been implicated in their 
interaction with phospholipids on the surface of endothelial 
cells and platelets (Jenny et al . , 1987, Proc. Natl. Acad. 
Sci. U.S.A. 84:4845-4850; Toole et al . , 1984, Nature 312:342- 
347) . Homology to the Xenopus protein A5 was also observed. 
A5 is a neuronal cell surface molecule which is expressed in 
retinal neurons and the neurons in the visual center with 
which che retinal neurons contac: (Takagi et al . , 1991, 
Neuron 7:295-3 07) . A5 has been proposed co play a role as a 
neuronal recognition molecule in the development of nhis 
neural circuit, perhaps through mediating intercellular 
signaling. The protein for which this discoidin I -like 
domain was named is a protein expressed in Dictyostelium 



discoideum, which serves an essential role in che aggregation 
of individual cells. 

The DNA fragment encoding the trapped exon was employed 
as a probe in a Southern blot experiment and shown to 
5 hybridize with regions of the del -2 locus outside of the 
region that was employed in the exon trap construct. Given 
this finding, cDNA cloning was pursued by using the exon trap 
probe to screen an 11.5 day embryonic mouse cDNA library. 
Clones were plaque purified, and inserts subcloned into 
10 plasmid for further analysis. Nucleotide sequence analysis 
showed that two of the embryonic cDNA clones contained the 
sequence of the trapped exon. Sequence from the clones was 
used to expand the deduced amino acid sequence of the 
discoidin I-like domain (Figure 2) . The full nucleotide 
15 sequence of these cDNAs was analyzed and cloned into plasmid 
vectors which allowed the generation of cRNA transcripts for 
RNAse protection and in situ hybridization (Figure 3A-3E) . 

A human cDNA was isolated from a human fetal lung cDNA 
lambdaphage library purchased from Clontech Inc. (Figure 4A- 
20 4C) . A portion of the mouse del -I cDNA was used as a probe 
(Figure 5) . The identity of the human cDNA clone was 
confirmed by comparing the human and mouse DNA sequences . 
These clones show approximately 8 0% DNA sequence homology and 
approximately 94% amino acid sequence homology (Figure 6) 
25 These sequences are referred to as the "major" form of del -2. 
Upon initial isolation of del-1, standard molecular biology 
methods were used for isolating additional clones. 

DNA sequence analysis of the human del-1 revealed an 
open reading frame of 1,446 base pairs predicted to encode a 
3 0 481 amino acid protein with a molecular weight of 53,797. 
The mouse cDNA encodes a 480 amino acid protein. Homology 
comparisons with DNA and protein databases indicated that the 
Del-1 protein was composed of three EGF-like protein domains, 
followed by two discoidin I/factor VHI-like domains (Figure 
35 7) . Genes similar to del -I included some key regulators of 
cell determination and differentiation such as Notch. 
Overall, the Del-1 protein has a structure similar to the 



membrane-associated milk fat globule membrane protein, 
MGF-E8, which has been used to develop antibodies for imaging 
breast cancer (Figure 8) . 

A physiologic function for the Del-1 protein is 
5 implicated by the activities which have been demonstrated for 
EGF-like and discoidin I/factor VIII -like domains in other 
proteins. EGF-like domains have been shown to participate in 
protein-protein binding interactions, while the discoidin I- 
like domains of factor VIII are believed to mediate binding 

LO to cell membranes through association with negatively charged 
phospholipids. Thus, the Del-1 protein may generate a signal 
for endothelial cell determination or differentiation by 
binding to the membranes of precursor cells and interacting 
with an EGF-like domain receptor orotein. 

-5 Key structural features of the open reading frame of 

human Del-1 include: 

1) the presumed initiator methionine and 

putative secretion signal sequence 
(Figure 9) 

0 2) the three EGF-like domains (Figure 10) 

3) the two discoidin I -like domains. 

Further cloning and analysis of both the human and 
murine del-1 genes revealed additional variant f orms . For 
example, a human splicing variant (22 0 clone) was obtained in 

5 which 3 0 bp (i.e. 10 amino acids) between the first and 

second EGF-like domains of the major form of del-1 had been 
removed (Figure 11) . m addition, a truncated version of 
murine del-1 was isolated, which contained a signal peptide 
sequence, all three EGF-like domains and only a partial 

0 amino-terminal discoidin I/factor VHI-like domain (about 
40%) . This variant is referred to as murine del-1 minor 
sequence, which is disclosed in Figure 12A-12E. This 
transcript was cloned only from mouse embryonic libraries, 
but was verified through cloning of several" independent 

5 cDNAs . 
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7 - EXAMPLE: TISSUE DISTRIBUTION OF DEL - 1 GENS EXPRESSION 
7.1. MATERIALS AND METHODS 

7.1.1. WHOLE MOUNT STAINING OF TRANSGENIC MOUSE 
EMBRYOS 



5 Male transgenic animals of second or third generation 

had been crossed with 8-10 week B6D2F1 females, and embryos 
harvested at 7.5, 8.5, 9.5, 10.5, and 13.5 days. Timing was 
based on the convention that noon of the day of plugging was 
0.5 day post-coitum (pc) . Embryos were harvested, dissected 

10 free of decidua and membranes, fixed in 2% glutaraldehyde , 
and stained as a whole mount in a standard X-gal indicator 
solution according to standard protocols. An exception was 
that embryos older than 11.5 days were bisected which allowed 
better penetration of the fixative and staining solution. 

15 Stained tissues were identified in whole mount embryos by 

examination at 7-70x with an Olympus SZH10 stereomicroscope, 
and photographed under darkfield illumination. Embryos 7.5, 
8.5, 9.5, and 13.5 days pc were embedded in paraffin, 
sectioned, counterstained with nuclear fast red and examined 

2Q under bright field with a Zeiss Axioplan microscope. 

7,1.2. NORTHERN BLOT ANALYSIS 

In order to study the expression of the del -1 gene, 
Northern blots containing RNA obtained from a variety of 

25 human and mouse tissues (Clontech, Palo Alto, CA) were 
hybridized with a radiolabeled DNA probe as shown in 
Figure 5. In addition, adult organs, 15.5 dpc whole embryos 
and organs dissected from embryos were disrupted with a 
polytron, and RNA isolated over C S C1 gradient (Sambrook et 

3Q al., 1989, iMolecuIar Cloning, A Laboratory Manual, Cold 
Spring Harbor Laboratory, NY) . Briefly, the blots were 
prehybridized at 42°C for 3-S hours in a solution containing 
5X SSPE, 10X Denhardt's solution, 100 fig /ml freshly 
denatured, sheared salmon sperm DNA, 50% formamide (freshly 

35 deionized) , and 2% SDS . The radiolabeled probe was heat 

denatured and added to the prehybridization mix and allowed 
to hybridize at 42 *C for 18-24 hours with constant shaking. 
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The blots were rinsed in 2X SSC, 0.05% SDS several times at 
room temperature before being transferred to a wash solution 
containing 0 . IX SSC, 0.1% SDS and agitated at 50°C for 40 
minutes. The blots were chen covered with plastic wrap, 
mounted on Whatman paper and exposed to x-ray film at -70°C 
using an intensifying screen. 

7.1.3 REVERSE TRANSCRIPTION -POLYMERASE 
CHAIN REACTION (RT-PCR) 

Total RNA was isolated using standard laboratory 

procedures (Sambrook et al . , 1989, Molecular Cloning, A 

Laboratory Manual, Cold Spring Harbor Laboratory, NY) . 

Approximately 1 jig of total RNA was reverse transcribed and 

the cDNA was amplified by ?CR (Perkin Elmer, Norwalk, CT) . 

The PCR amplification conditions were: 94°C for 30 sec, 60°C 

for 30 sec, 72°C for 30 sec for a total of 40 cycles. The 

amplified products were resolved by agarose gel 

electrophoresis and visualized by ethidium bromide staining. 

The amplimers were: 

-f- strand primer: ACC CAA GGG GCA AAA AGG A 

- strand primer: CCT GTA ACC ATT GTG ACT G 

7.2. RESULTS 

Expression of del - 1 in various human and mouse tissues 
and cell lines was investigated by whole mount staining, 
Northern blot analysis and RT-PCR. Results of experiments 
are summarized in the subsections below. 

7.2.1 EXPRESSION ANALYSIS BY HISTOCHEMISTRY 

When the earliest time point was investigated by whole 
mount and histochemical staining in transgenic mice at day 
7.5 dc, expression of the lacZ reporter crene was shown in 
cells forming the exura embryonic mesoderm (Figure 13A) . 
These cells would form the yolk sac and give rise to cells o 
the blood island. Expression of the la.cZ reporter gene in 
this locus is thus one of uhe earliest: known markers of the 
endothelial cell lineage. The only other marker which has 



been shown to be expressed in precursors of endothelial cells 
at this early stage of development is the receptor tyrosine 
kinase flk-1 (Millauer et al . , 1993, Call 72:835-846). 
However, del -I expression was not found in the allantois, as 
5 with other early markers of the endothelium such as flk-1 
(Yamaguchi et al . , 1993, Development 118:489-498). 

At day 8.5, lacZ staining was seen in cells in the blood 
islands of the yolk sac. Interestingly, staining was not 
detected in mature endothelial cells lining the blood island, 
10 but rather in round cells found in clumps within the blood 
island (Figure 13B) . These round cells had large nuclei and 
were closer in appearance to hematopoietic precursors rather 
than endothelial cells. This expression pattern was distinct 
from all other early endothelial markers. Thus, the del -2 
15 locus might be expressed in early embryonic cells which were 
precursors to both endothelial and hematopoietic lineages. 
In the late primitive streak stage embryo at 8.5 days pc, 
there was also staining of endothelial cells associated with 
the developing paired dorsal aortae . LacZ staining was seen 
2 0 in cells in the region of the forming heart at this staae, 
and these were presumably endothelial cells that would form 
the endocardium. By day 9.5 (10-14 somites), the endocardium 
and endothelial cells forming the outflow tract and aorta 
showed la.cZ staining (Figure 13C, 13D) . This staining 
25 persisted until day 10.5 and 11.5, and by whole mount 

analysis endothelial cells associated with all large vascular 
structures were expressing the reporter gene . 

LacZ staining of embryos at day 13.5 of development was 
evaluated in the whole mount, and in sections made from 
30 paraffin embedded embryos. 3y this time, there was only 

patchy staining of endothelial cells in large vessels such as 
the aorta, whereas smaller vessels had virtually no staining 
(Figure 13S) . The only blood vessels which showed prominent 
lacZ staining ac this stage were the pulmonary capillaries. 
35 The developing pulmonary vascular network stained intensely, 
making the entire lung appear grossly blue -green (Figure 
13E) . Identification of the stained cells was made by 



microscopy of stained sections (Figure 13F) . Also, 
visualization of X-gal stained cells forming vascular 
channels was possible by viewing thick sections with Nomarski 
.differential interference contrast optics. Organ vasculature 
5 associated the liver, brain and kidney showed no staining. 
In the heart, there was some residual staining of endothelial 
cells of the atrium. The majority of endothelial cells 
lining the ventricle no longer stained. The striking finding 
in the ventricle was that the cells forming the papillary 

10 muscle and the mitral valve showed marked staining. This 
labeling was seen not only in the endothelial cells on the 
surface, but in cells forming these structures. In a similar 
fashion, cells in the area of the forming valves of the aorta 
and pulmonary showed lacZ activity. Again, cells in the 

15 forming valve and in the wall of the vessel were stained 

(Figure 13G and 13H) . The only non-cardiovascular staining 
was observed in cells in the areas of active bone formation. 
In particular, staining was most prominent in the proximal 
portions of the ribs, vertebrae, and the limb girdles (Figure 

20 13E) . After 13.5 days, the only cells expressing the lacZ 
gene were pulmonary endothelial cells. After approximately 
15.5 days of development, expression of the reporter 
transgene diminished and was completely negative by the time 
of birth. 

25 The aforementioned observations indicate that the 

protein encoded by the transcription unit in the del-1 locus 
is involved in early developmental processes in the 
cardiovascular system. This gene is not only a lineage 
marker, since it is expressed in restricted groups of 

3 0 endothelial cells in a temporally regulated fashion. The 
restricted expression seen at later stages indicates a 
connection with the origin of these endothelial cells, the 
mechanism of blood vessel formation, or the context-derived 
phenotype of these cells. Cells of the primordial 

35 endocardium express this marker, indicating a role in 

carciogenesis. Most striking is the pattern of expression in 
the developing valvular apparatus of the heart. Competent 



endothelial cells in the forming septum and valves have been 
shown to undergo an epithelial -mesenchymal transformation. 
This transformation appears to be due, at least in part, to 
an inductive signal, such as transforming growth factor 
5 beta 3, which is released by the myocardium (Potts et al . , 
1991, Proc. Natl, Acad. Sci . U.S.A. 8 8:1516-1520; Sinning et 
al., 1992, Anat. Rec. 232:285-292). Reporter gene expression 
in the SLM275 mouse marked the competent cells of the 
endocardium which would respond to this signal, and 
0 expression appeared to persist for some time after the 

transformation (Figure 13G and 13H) . This pattern of gene 
expression is unlike that described for any known molecule. 
Although the early endothelial expression pattern is similar 
to that characterized for the tyrosine kinases zek and flk-1 
(Dumont et al . , 1992, Oncogene 7:1471-1480; Millauer et al . , 
1993, Cell 72:835-846), there are striking differences at 
later stages which clearly indicate that lacZ expression in 
the transgenic animals marks a novel aene . 

7-2-2. EXPRESSION ANALYSIS BY NORTHERN BLOT 
Expression of del-1 in various fetal and adult tissues 
was examined by Northern blot analysis (Tables 1 and 2) . A 
portion of the mouse cDNA clone (0.3 kb Sac I probe) was used 
as a probe on six poly A RNA filters purchased from Clontech 
Inc. Human fetal tissues which were undergoing 
vasculogenesis were positive (Table 2) . An organ blot 
generated with RNA from a 15.5 day mouse embryo indicated 
expression in highly vascular organs such as kidney, lung, 
nervous system and head. Also, the time course of expression 
in whole mouse embryos was consistent with the 0-gal staining 
results observed in transgenic mice (Table 3) In general, 
adult mouse tissues were negative, or only weakly positive, 
(Table 4) . Mouse cDNA clones isolated from a brain cDNA 
library appeared to be identical to the embryonic del -I. Two 
human cancer cell lines tested were weakly positive 
(Table 5) . The results of Northern blot analysis were 



basically consistent with the pactern for a gene which 
specifically active during endothelial cell developmen 
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Table 1 
Human Adult 
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heart: 
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brain 




placenta 
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lung 
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liver 
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spleen 


— 


thymus 


_ 


prostate 
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— 


testis 




ovary 
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skeletal muscle 




kidney 




pancreas 




small intestine | 


-1- 


colon 




peripheral blood leukocyce 
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Table 2 
Human Fetal 


brain 


* * m i 

k * ■ 


lung 




liver 


-J- 


kidney 


-r -r 


(Pooled from 17-25 wks) 
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Table 3 
Mouse Embryo 

7 -day 

11- day 

15 -day 

17-day 

Table 4 
Mouse Adult 

heart 

brain 

spleen 

lun g 

live r 

skeletal muscle 

kidney 

Table 5 
Human Cancer Cell 

Promyelocytic leukemia HL6 0 
HeLa cell S3 

chronic myelogenous leukemia 
K-562 

1 ymphobla scic leukemia MOLT4 

Burkit ' s lymphoma R aj i 

colorectal adenocarcinoma 
SW480 

lung carcinoma A54 9 

melanoma G3 61 



7-2.3. EXPRESSION ANALYSIS BY RT-PCR 

RNA from mouse yolk sac (day 8 through day 12) and mouse 
fetal liver (day 13 through day 13) were nested for del -1 
expression by RT-PCR. All nested samples were positive, 

5 consistent with the Northern bloc analysis and results from 
0-gal staining in transgenic mice (Table 6) . Several mouse 
yolk sac-derived cell lines were also tested by RT-PCR for 
expression of del -I. For comparison, several other cell 
lines and total d!5 mouse fetal liver RNA samples were 

0 tested. All samples shown in Table 7 except ECV3 04 (a human 
endothelial cell line) were of mouse origin. The yolk sac- 
derived cell lines grown in long-ierm culture were not 
expressing del -2 at a detectable level. These cell cultures 
were not forming endothelial cell-like structures under these 
conditions. In contrast, an endothelial tumor line, EOMA, 
expressed high levels of del -1 . 



Table 6 
Yolk Sac and Fetal L ive r 



Sample 


Result 


d8 Yolk Sac 


» 


d9 Yolk Sac 


-!_ 


dlO Yolk Sac 




dll Yolk Sac 
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d!2 Yolk Sac 


+ 


d!3 Fetal Liver 


-4- 

1 


dl4 Fetal Liver 


t 


dl 5 Fetal Liver 


* 

* 


dlS Fetal Liver 


I 


dl 7 Fetal Liver 


T 


dlS Fetal Liver 


* | 
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Table 7 
Cell Lines 
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del -1 
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Prol3 5 




Prol75 




D-l 




A10 




ROSA02 




dl5?L 


-r -f 


SOMA. 


+ + -r 


ECV3 04 (human) 
— — . 





20 
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A number of human tumors implanted in nude mice and 
cultured in vitro were shown to express del ~1 by RT-PCR. For 
example, Table 8 shows the expression of del -1 in human 
osteosarcoma cell line 14 3B in vivo and in vitro. SOMA was 
used as a positive control. CD34 , flk-1 and tie-2 are known 
markers for endothelial cells. When human and mouse del-1 
specific PGR primers were used, both human (tumor) and murine 
(host) del-1 expression was detected. In addition, a variety 
of human tumor cell lines expressed del-1 in culture 
(Table 9) . These results indicate that Del-1 may be used as 
a tumor marker in certain cancers diagnoscically and 
therapeutically. In addition, host expression of del-1 is 
also up-regulated, possibly due no angiogenesis in tumor 
sites . 
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Table 8 



Human osteosarcoma 143B 



Sample 


Actin 


del -2 CD34 


flk-1 


■ 

tie-2 


control nude mouse 
skin 




nd 


nd 


nd 


7 day tumor 




4- n ri 




I Id 


iu Qay tumor 


* 




+ 


i 


14 day tumor 




+ -r 


+ 


+ 


cultured 143B cells 


■ 

T 


-r 






EOMA 


1 

T 


+ -r 


+ 


+ 


nd = not: determined 


Tahl 


e 9 






Human 


tumor 


cell lines 







Cell Type 


Sample 


27 cycles 


33 cycles 


Normal 


Myoblast 




~i- ~i- -i- 

1*4 




HYSE-E 




» * « 

t* -r -r 




HYS-VS1 


~ -t. 

* > 


-4- 4- — -i- 


Leukemia 


K5 6 2 








HEL 




+ 




Mo7e 






Glioblastoma 


U-118 MG 


i 


-i- -t- -I. 

1 * ■ 




U-8 7 MG 


* ■ 


+ TT 


CNS Tumor 


SF295 




r « 




U251 


* • 


[ t » » 
► i « • 




SN375 


■ " 






SNB19 




-f + -T 




SF539 






Osteosarcoma 


1433 




+ + -*- + 


Breast Carcinoma 


DU4 4 7 5 








MCF-7 


+/- 






MDA2 3 I 




* ' 


Endothelial 


SCV3 04 








HUV3C 


i 


+ + - 



8 - EXAMPLE: IMKUNQREACTIVITY OF D3L-1 GENS PRODUCT 
8.1. KATE RIALS AND METHODS 

8.1.1. ANTIBODY PRODUCTION 

A partial del -I cDNA encoding amino acids 353 to 439 of 
5- che murine gene was cloned into piMALC2 (New England Biolabs) 
to generate a maltose binding protein-partial Del-1 fusion 
protein. The del-1 sequence included in this construct 
encodes a portion of the carboxyl terminal discoidin-like 
domain. Recombinant fusion protein was expressed and 

.0 purified over an amylose affinity matrix according to the 
manufacturer's recommendations. Protein was emulsified into 
Freund's complete adjuvant, and injected as multiple 
subcutaneous injections into two New Zealand White rabbits. 
Boosting and harvesting of immune serum was performed 

.5 according to established methodology (Harlow and Lane, 1988, 
Antibody: A Laboratory Manual, Cold Spring Harbor 
Laboratory) . Immune serum obtained after the second boost 
was subjected to affinity purification. First, the antiserum 
was precleared over a Sepharose column coupled to total 

0 bacterial lysate . Subsequently, the antiserum was purified 
over an affinity column made from recombinant: fusion protein 
coupled to Sepharose. The specificity of the antiserum was 
evaluated first with western blots containing proteins from 
bacteria expressing the recombinant fusion protein before and 

5 after cleavage with factor Xa, or the maltose binding protein 
alone. Whole bacterial lysates from cells induced with IPTG 
were run on polyacrylamide gels, transferred to 
nitrocellulose, and probed with the affinity-purified 
antiserum. While crude antiserum labeled bands corresponding 

0 to maltose binding protein and the Del-1 portion of the 
fusion procein, affinity-purified antiserum specifically 
labeled the Del-1 component of the fusion protein. 

8.1.2. WESTERN BLOT 
5 For western blots of eukaryotic proteins, cells were 

harvested by lysis in a standard lysis buffer or Laemmli 
loading buffer. Cell culture supernatant was collected and 



concentrated by cencrif ugation in a centricon filter, and 
extracellular matrix harvested by first removing cells with 
1 mM EDTA in PBS , and then scraping the cell culture dish 
with a small volume of Laemmli buffer an 9 0 °C. 

5 

8,1.3. IMMUNOHISTOCHEMISTRY 
Immunohistochemistry was performed on sections prepared 
from Bouin's fixed, paraffin-embedded, staged mouse embryos 
according to well established methodology (Hogan et al . , 

0 1994, Manipulating the Mouse Embryo, Cold Spring Harbor 

Press; Quertermous et al . , 1994, Proc. Natl. Acad. Sci . USA 
91:7056). The affinity-purified Del-1 antiserum was employed 
at a dilution of 1:500 to 1:1000, and the specificity of 
staining verified by competition with recombinant protein. 

5 Staining of cartilage was amplified by pre-treating the 
section with dilute trypsin solution. 

8-1-4. TRANSFECTJQN OF YOLK SAC CELLS 

A eukaryotic expression vector was constructed by 
0 cloning the entire open reading frame of the major del-1 
transcript into phbAPr-3-neo (Gunning et al . , 1987, Proc. 
Natl. Acad. Sci. USA 84:4831). This construct was 
transfected into yolk sac cells with Lipof ectamine (Gibco 
BRL) , and clones selected in the presence of 1000 fig /ml of 
5 G418. Clones were evaluated for del-1 expression by northern 
and western blotting, and a group of clones with varying 
amounts of Del-1 protein were selected for further study. To 
serve as negative controls, a group of clones were randomly 
selected from a transf ect ion with the empty phbAPr-3-neo 
0 vector. 

8.2. RESULTS 

The major murine del-1 coding sequence was inserted into 
a eukaryotic expression vector and transfected into Del- 
5 1-non-expressing yolk sac ceils (Wei et al . , 1995, Stem Cell 
13:541) . Pooled cransf ectants with an empty expression 
vector or the del-1 construct were selected in G418 . 



Lysates, cell culture supernatants and extracellular matrix 
were prepared from cransfected cells, and reacted with an 
affinity-purified rabbit antiserum in Western blots. The 
polyclonal antiserum was generated to recombinant Del - 1 

5 fusion protein expressed in bacteria. Figure 14A shows that 
a band of 52,000 daltons molecular weight was recognized in 
cell lysates prepared by harvesting the cells in lysis or 
standard Laemmli gel loading buffer, and in extracellular 
matrix. This band corresponds with the predicted molecular 

0 weight for Del-1 based on the deduced amino acid sequence, 
and represented the full-length Del-1 protein. In contrast, 
no protein was identified with culture supernatants harvested 
from the transf ectancs , even when concentrated 100 -fold. 
Additionally, smaller proteolytic fragments were also 

5 detected. These results indicate that Del-1 is secreted 

across the surface of endothelial cells, and deposited in the 
extracellular matrix. 

Several stably transfected yolk sac cell clones with the 
del-1 gene were selected (Figure 143) . When the transfected 

0 cells were reacted with the aforementioned antibody, both the 
membrane of certain yolk sac ceils and the extracellular 
matrix were stained as compared with mock- transf ected yolk 
sac cells as negative control (Figure ISA , 15B) . In keeping 
with this staining pattern, immunos taining of developing bone 

5 of a 13.5 day mouse embryo detected the Del-1 protein in the 
laquanae within the bone, which were composed of 
extracellular matrix proteins (Figure 16) 

In order to test the expression of del-1 in tumor cells 
by immunohistochemistry , human glioma cells were implanted in 

0 nude mice. The tumor was isolated, sectioned and stained 
with the aforementioned antibody followed by an anti-rabbit 
antibody conjugated with horse radish oeroxidase and 
developed with Sigma Fast Red substitute. Figure 17A shows 
that the in vivo tumor cells were stained with the antibody 

5 in a polarized fashion. Polarization of del-1 expression in 
tumor cells might have resulted from the interaction of the 
gene product with cellular receptors on adjacent cells. In 



addition, a blood vessel of mouse origin traversing the human 
tumor was also stained with the antibody (Figure 17B) 



9 * EXAMPLE : DEL-1 INHIBITS VASCULAR FORMATION 
5 9.1 MATERIALS AND METHODS 

9-1.1. ANGIOGENICS IS ASSAYS 
In vitro angiogenesis assays on " MATRIGEL M (Biocoat, 
Becton Dickinson) were conducted in 24 well plates coated 
with 50 /il of "MATRIGEL" . del -I transf ectant s and control 
0 transf ectants were plated at a density of 5xl0 4 cells/well 

(low density) or 2xl0 5 cells/well (high density) , and observed 
for seven days . 

For the assay evaluating morphogenet ic potential of wild 
type yolk sac cells on del -I conditioned matrix, the matrix 

5 was generated by growing 10* del-1 transf ectancs in 6 cm 

dishes for 7 days . A control matrix was generated by growing 
control transf ectants under identical conditions. 
Transf ected cells were removed with 0.5 M EDTA and extensive 
washing, and 10 5 wild type yolk sac ceils were plated on the 

) matrix produced by the del-1 or the control transf ectant s . 
Cells were cultured and observed for seven days. 

For the in vitro angiogenesis sprouting assay, del-1 and 
control transf ectants were trypsinized, and 10 5 cells cultured 
m a 15 ml conical tubes for 43 hours. Cell cultures were 

> then transferred into a bacterial petri dish, and cultured 
ror 4-7 days. Under these conditions, cell aggregates were 
formed. Several aggregates were collected for del-1 and 
control transf ectants , and these were transferred to 24 well 
plates coated with » Matrigel » . Sprouting angiogenesis was 
evaluated at 24 and 48 hours. 

9.2. RESULTS 

The yolk sac cell line, YS-3, was chosen as the parental 
cell for del-1 transf ection because it had characteristics of 
embryonic endothelial cells, did not express del-1, was 
clonal and long lived in culture (Figure 18A) . Most 
importantly, these cells provided a model of vascularization 



of the early yolk sac. While they were easily grown and 
maintained with frequent: passage, when allowed to accumulate 
to high density they spontaneously formed vascular 
structures. This process was accelerated when the cells were 
5 plated on the basement membrane-like material " MATRIGEL" , on 
which they behaved similar to various types of cultured 
endothelial cells (Figure 183) . Cell lines transfected with 
the cDNA encoding of the major form of del-1 were selected 
for varying levels of expression of the transfected construct 
10 (Figure 14B) . Cell lines transfected with the empty 
expression plasmid were selected to serve as negative 
controls . 

The del-1 transfected yolk sac clones and mock- 
transfected yolk sac lines were compared for their ability to 

15 form branching vascular- like structures on "MATRIGEL" . After 
24 hours on "MATRIGEL", the negative control transf ectants 
had established an intricate network typical for these cells 
(Figure 18C) . Cells (L10) expressing high levels of del-1 
showed a markedly different pattern, assembling into multiple 

20 well-spaced clusters (Figure 18D) . This abrogation of 
morphogenesis was directly related to the level of del-1 
expression, as low del-1 expressing clones, L13 and L14 , 
showed some degree of branching morphology. 

Since Del-i protein is deposited in the extracellular 

25 matrix, one del -I expressing clone, L10, was used to generate 
a cell culture matrix containing Del-1 protein. Matrix 
generated by negative control clones should differ only by 
the absence of Del-1. Transfected and control lines were 
cultured for 7 days, and then gently removed from the culture 

3 0 dish by extensive washing with 1 mM EDTA . 3y visual 
inspection, only a rare cell was not removed with this 
technique. Non- transf ected native yolk sac cells were then 
plated on the Del - 1 -containing and the control matrices, and 
scored for their ability to assemble into a network. The 

35 yolk sac ceils required several days at high density to 
undergo morphogenesis, and the network was lace -like in 
appearance. Cells grown on the matrix produced by negative 
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control transfectants were able to uroduce the network 
(Figure 18E} . In contrast, yolk sac cells grown on matrix 
containing Del - 1 revealed no evidence of morphogenesis. They 
formed instead a dense monolayer (Figure 18F) . 
5 Next, an in vitro angiogenesis sprouting assay was 

employed with the transfected yolk sac lines. This assay has 
been employed to evaluate angiogenic potential (Pepper at al . 
1991, J. Cell. Physiol. 146:170). Transfected cells were 
allowed to stand overnight in a conical cube to allow them to 
10 aggregate, and the cell mass was then placed on "MATRIGEL" . 
The ability of the del-1 expressing cells to migrate onto the 
"MATRIGEL 11 and assemble into branching structures was 
compared to control cells. Within 24 hours, the control 
cells formed a series of branching projections, while the 
15 cells expressing del -1 remained virtually confined to the 
cellular aggregate (Figure 18G and 18K) . While there was 
some evidence of spreading of the del - 1 expressing cells 
after 4 8 hours, it was more as a sheet rather than a 
sprouting structure . 
2 0 Hence, Del-1 inhibits vascular morphogenesis and may be 

used to regulate endothelial cell differentiation. 

10 • EXAMPLE: DEL-1 BINDS TO INTEGRIN ALPHA V BETA 3 
10.1. MATERIALS AND METHODS 

10.1.1. RECOMBINANT DEL-1 PURIFICATION AND 

REFOLDING 

Recombinant murine Del-1 protein (major form) was 
prepared using an E . coli expression system and protein 
refolding technique. E . coli cells with the del-1 containing 
pET28a vector (Novagen Inc.) were grown and induced following 
the protocol recommended by the manufacturer. Approximately 
50 to 100 mg of crude recombinant Dei -I were routinely 
produced from 1L of bacterial culture in the form of 
insoluble cytoplasmic inclusion bodies . Inclusion bodies 
were isolated by sonication of the E . coli cells, 
cencrif ligation and collection of the pellet fraction. 
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Inclusion bodies from 500 nl of culture were then washed 
three rimes with 50 ml of 2M Urea, 0.025 M Tris-Cl (pK8.0), 
0.025% Triton X100. This procedure yielded a crude, 
insoluble, Del-1 product of > 80% purity. 
S Recombinant Del-1 was dissolved by suspending the pellet 

from 500 ml of culture in 2.5 ml of 8M Urea, 100 mM DTT, 
0.1 M Tris-Cl (pH8.0), 0.05% Triton X100, followed by 
incubation at room temperature for 1 hr . Insoluble material 
remaining was removed by centrif ugat ion and the soluble 

10 supernatant fraction was diluted 10 fold to 25 ml with 8M 
Urea, 100 mM Tris-Cl (pH 8.0), 0.05% Triton X100. Protein 
concentration was then measured by Bradford protein 
determination assay. 

Soluble, reduced Del-1 was refolded by diluting to a 

15 final concentration of 0.01 mg/ml into refolding buffer: 
100 mM Tris-Cl (pHS.O), 100 mM (NH,) 2 SO., 2 mM reduced 
glutathione, 0 . 5 mM oxidized glutathione, 0.05% sodium azide, 
0.025 mg/ml PMSF . Refolding was performed by incubating this 
reaction mix at 4°C for one week. Refolded Del-1 was then 

2 0 concentrated using an Amicon spiral concentrator and the 
soluble material remaining was collected. 

The recombinant Del-1 product produced from the pET28a 
expression vector is a fusion protein with both N-terminal 
and C-terminal polyhist idine tags. This product was purified 

25 using the Novagen His tag resin purification system, 
following the protocol recommended by the supplier. 

Refolded murine recombinant Del-1 was soluble and stable 
when stored at 4 °C in Tris-Cl buffer with 100 mM (NH, ) 7 S0. at 
concentrations of less than or equal to 100 mg/ml. 

30 10.1.2. CELL ADHESION ASSAYS 

Human umbilical vein endothelial cells (HUVEC) 
(Clonetics Inc., San Diego, CA) were grown as indicated by 
the supplier in endothelial growth media supplemented with 
10 ng/ml human recombinant epidermal growth factor, 1 ug /ml 

35 hydrocortisone, 50 ug/ml gentamicin, 12 ug/ml bovine brain 
extract and 2% F3S . Cells were grown at 37°C/5% CO, to 70% 
confluency before use in the binding assay. Non- tissue 

- 67 - 



culture treated 96 well plates were coated with appropriate 
levels of target protein (1-20 fig of either murine 
recombinant Del-1, vitronectin, or BSA) diluted in calcium 
and magnesium free PBS for 24 hrs at 4°C. The plates were 

5 washed with PBS and blocked for 3 0 min with a solution of 
heat treated (95°C for 5 min) PBS containing 3% BSA. HUVEC 
cells were harvested by trypsinization and resuspended in an 
adhesion buffer (Hanks balanced salt solution pH 7.4 
containing lOmM Kepes, 2 . 2 mM MgCl 2 , 2 mM Cacl 2/ 0 . 2mM MnCl 2 

.0 and 1% BSA) . Cells (lOVlOO /xl) were added to each well in 
the presence or absence of the indicated antagonists or 
controls at varying concentrations. Antagonists included 
anti-human ctV/33 (clone LM609, Chemicon Inc.), RGB peptides 
(the inactive control GRGESP) or RGD the stable antagonist 

5 GPenGRGDSPCA or GRGDdSP all from Gibco) . Cells were 

incubated at 37°C/5% C0 2 for 60-90 min and wells were washed 
until no cells remained in the BSA control. To count 
remaining cells, 100 fil of endothelial media was added to 
each well. Cells number was determined by the Promega Cell 

0 titer AQ as indicated by the manufacturer. 

10.2. RESULTS 

Recombinant Del-1 protein and del-1 t ransf ectant s bound 
HUVEC . In order to identify a cellular receptor on HUVEC for 

5 Del-1, various peptides and antibodies were used to inhibit 
the interactions between Del-1 and HUVEC in cell adhesion 
assays. Figure 19 shows that an anti-aV/33 antibody 
specifically inhibited recombinant Del-1 binding to HUVEC. 
In contrast, anti-cxVj35 did not inhibit, nor did the control 

0 Ig. Furthermore, an RGD peptide was also shown to inhibit 
Del-1 binding to HUVEC (Figure 20) . Similar results were 
obtained using extracellular matrix obtained from del-1 
transfected cells. Therefore, Del-1 binds to aVB 3 expressed 
by HUVEC, possibly via RGD in its second EGr-like domain. 

5 aV/?3 is an integrin expressed by certain cell types and 

13 associated with bFGF- induced angiogenic endothelial cells. 
Agents that bind to this integrin induce aoootosis of 



angiogenic endothelial cells. Since Del-1 binds to this 
integrin, it may be used to induce apoptosis during 
angiogenesis in tumor sites zo reduce tumor growth. 

5 11 • EXAMPLE: CHROMOSOMAL LOCALIZATION OF HUMAN DEL-1 

DNA from PI clone 10043 was labeled with digoxigenin 
dUTP by nick translation. The labeled probe was combined 
with sheared human DNA and hybridized to normal metaphase 
chromosomes derived from PHA stimulated peripheral blood 
10 lymphocytes in a solution containing 50% formamide, 10% 

dextran sulfate and 2X SSC. Specific hybridization signals 
were detected by incubating the hybridized slides in 
f luoresceinated antidigoxigenin antibodies followed by 
counterstaining with DAPI . The initial experiment resulted 
15 in specific labeling of the long arm of a group B chromosome. 

A second experiment was conducted in which a probe that 
had previously been mapped to 5q34, and confirmed by 
cohybridization with a probe from the cri du chat locus which 
is known to localize to 5pl5, was cohybridized with clone 
20 10043. This experiment resulted in the specific labeling of 
the mid and distal long arm of chromosome 5 (Figure 21 A and 
B) . 'Measurements of 10 specifically hybridized chromosomes 5 
demonstrated that clone 10043 was located at a position which 
was 23% of the distance form the centromere to the telomere 
25 of chromosome arm 5q, an area that corresponded to band 5ql4 . 
A total of 8 0 metaphase cells were analyzed with 74 
exhibiting specific labeling. This region of the chromosome 
has been found to be a break point in some human cancers 
(Wieland and Bohm, 1994, Cancer Res. 54:1772; Fong et al . , 
30 1995, Cancer Res. 55:220; Wieland et al . , 1996, 12:97, 

Oncogene 12:97). Thus, chromosome 5 aberrations may lead to 
altered expression of del-1 and contribute to the malignant 
p he no type . 
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12 . DEPOSIT OF MICROORGANISMS 

The following organisms were deposited with the American 
Type Culture Collection (ATCC) , 12301 Park I awn Drive, 
Rockville, Maryland 20852. 

5 

Strain Designation Accession No. 

Hu DEL-1.Z1 ATCC 97155 

Hu DEL-1.Z20 ATCC 97154 

mus DEL-1.1 ATCC 97196 

10 mus DEL-1.18 ATCC 97197 

The present invention is noc to be limited in scope by 
the exemplified embodiments or deposited organisms which are 
intended as illustrations of single aspects of the invention, 
and any clones, DNA or amino acid sequences which are 

15 functionally equivalent are within the scope of the 

invention. Indeed, various modifications of the invention in 
addition to those described herein will become apparent to 
those skilled in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to 

20 fall within the scope of the appended claims. It is also to 
be understood that all base oair sizes civen for nucleotides 
are approximate and are used for purposes of description. 

All publications cited herein are incorporated by 
reference in their entirety. 

25 



30 



35 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Quertermous, Thomas 

Kogan, 3rigid 
Snodgrass, K . Ralph 
Zupancic, Thomas J. 

(ii) TITLE OF INVENTION: DEVELOPMENT ALLY -REGULATED ENDOTHELIAL 
CELL LOCUS - 1 

(iii) NUMBER OF SEQUENCES: 29 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: United States 

(F) ZIP: 10035-2711 



■ (v) COMPUTER READABLE FORM: 

;j! (A) MEDIUM TYPE: Floppy disk 

133 (B) COMPUTER: I3M PC compatible 

\A (C) OPERATING SYSTEM : PC-DOS /MS -DOS 

(D) SOFTWARE: Patently Release =1.0, Version #1.30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To 3e Assigned 

(B) FILING DATE: 05- JUN-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Poissant, Briar. M. 

(3) REGISTRATION NUMBER: 23,462 

(C) REFERENCE /DOCKET NUM3ER : 8907-034 



(ix) TELECOMMUNICATION INFORMATION : 
(A) TELEPHONE: (212) 790-9090 
(3) TELEFAX : (212) 85 9-8 354/9741 
(C) TELEX: 65141 Pennie 



(2) INFORMATION FOR SSQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
(3) TYPE: amino acid 

(C) STRAND EDNESS : 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE : oeotlde 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 1 : 

Asp Leu Leu Val Pro Thr Lys Val Thr Gly lie lie Thr Gin Gly Xaa 
1 5 ' io" 15* 
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Xaa Ala Lvs Asp Phe Gly As-q Val Leu ?he Val Glv Ser Tvr Lvs Leu 

20 ~ 25 30 

Ala Tyr Ser Asn Asp Gly Glu His Tid Met Val His Gin Aso Giu Lys 
3 5 ~ 40 45 

Gin Arg Lys Asp Lys Val Phe Gin Gly Asn Phe Asr> Asn Asp Thr His 
50 55 ' 50* 

Arg Lys Asn Val lie Asp Pro Pro lie Tyr Ala Arg Phe lie Arg lie 
65 70 75 80 

Leu Pro Leu Xaa Xaa 

85 

INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(3) TYPE: amino acid 

(C) STRAND ED NESS : 

{ D ) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Leu Gly Ser Ser Lys Glu Val Thr Gly lie lie Thr Gin Gly Xaa 
1 5 10 15 

Xaa Ala Arg Asn Phe Gly Ser Val Gin Phe Val Ala Ser Tvr Lys Val 

20 25 30 

Ala Tyr Ser Asn Asp Ser Ala Asn Trp Thr Glu Tyr Gin Asp Pro Arg 
35 40 " ' 45 

Thr Gly Ser Ser Lys Val Phe Gin Gly Asn Leu Asr> Asn Asn Ser His 
50 55 50* 

Lys Lys Asn lie Phe Glu Lys Pro Phe Met: Ala Arg Tyr Val Arg Val 
65 70 75 ' 80 

Leu Pro Val Xaa Xaa 

85 

INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: oe^tide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



Asp Leu Leu Lys lie Lys Lys lie Thr Ala lie lie Thr Gin Gly Xaa 
1 *" 5 10 15 



Xaa Cys Lys Ser Leu Ser Ser Glu Met Tyr Val Lys Ser Tvr Thr lie 

20 2 5 ~ ' 30 

His Tyr Ser Glu Gin Gly Val Glu Tro Lys Pro Tyr Arg Leu Lys Ser 
35 40 ' * 45 

Ser Met Val Asp Lys lie Phe Glu Gly Asn Thr Asn Thr Lys Gly His 
50 55 "* 60 

Val Lys Asn Phe Phe Asn Pro Pro lie lie Ser Arg Phe lie Arg Val 
65 70 75 SO 

lie Pro Lys Xaa Xaa 

85 

INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asp Leu Gin Lys Thr Met Lys Val Thr Gly lie He Thr Gin Gly Xaa 
1 5 ' 10 15 

Xaa Val Lys Ser Leu Phe Thr Ser Met Phe Val Lys Glu Phe Leu lie 

20 2 5 30 

Ser Ser Ser Gin Asp Gly His His Tro Thr Xaa Xaa Gin He Leu Tyr 
3 5 40 45 

Asn Gly Lys Val Lys Val Phe Gin Gly Asn Gin Aso Ser Ser Thr Pro 
50 5 5 * 60~ 

Met Met Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg Xaa Xaa Xaa Xaa 
65 70 75 ~ 80 

Xaa Xaa Xaa Xaa Xaa 

85 

INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: oeotide 



I 

1 



t 

J 



(XX) SEQUENCE DESCRIPTION; SEQ ID NO : 5 : 

Asp Leu Glu Asn Leu Arg ?he Val Ser Gly lie Giy Thr Gin Gly Ala 
1 5 ~ icf ' 15" 

lie Ser Lys Glu Thr Lys Lvs Lys Tyr ?he Val Lvs Ser Tyr Lys Val 

20 25 * 30 

Asd lie Ser Ser Asn Giv Glu Aso Tro lie Xaa Xaa Thr Leu Lys Glv 
35 40 45 

Asp Asn Lys His Leu Val Phe Thr Glv Asn Thr Asp Ala Thr Asp Val 
50 * 55 ' 60 

Val Tyr Arg Pro Phe Ser Lys Pro Val lie Thr Arg Phe Val Arg Leu 
65 70 75 SO 

rl Arg ?ro Val Thr Trp 

'S 85" 

\}\ (2) INFORMATION FOR SEQ ID NO : 6 : 

^.j (i) SEQUENCE CHARACTERISTICS: 
.fl (A) LENGTH: 85 amino acids 

;J (B) TYPE: amino acid 

*** (C) STRANDEDNESS: J 

H (D ) TOPOLOGY: unknown 

5: 

(ii) MOLECULE TYPE: peotide 



1 



H \ 

it - 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Leu Ala Glu Glu Lvs lie Val Arg Glv Val lie lie Gin Glv Xaa 
1 5 10^ 15' 

Xaa Gly Lys His Lys Glu Asn Lys Val Phe Met Arg Lys Phe Lys lie 

20 2 5 " ' 30 

Gly Tyr Ser Asn Asn Gly Thr Glu Trp Glu Mer lie Her Asr> Ser Ser 
35 40 45 

Lys Asn Lys Pro Lys Thr Phe Glu Gly Asn Thr Asn Tyr Aso Thr Pro 
50 55 60 

Glu Leu Arg Thr Phe Xaa Ala His lie Thr Thr Gly Phe lie Arg lie 
65 70 75 80 

lie Pro Xaa Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: oeotide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Gly Cys Glu Val Pro Arg Thr Phe Met: Cvs Val Ala Leu Gin Glv Xaa 
1 5 10 15~ 

Xaa Xaa Arg Gly Xaa Asp Ala Asp Gin Trp Val Thr Ser Tyr Lys lie 

20 " 25 30 

Arg Tyr Ser Leu Asp Asn Val Ser Tro Phe Xaa Xaa Xaa Xaa Xaa Glu 1 
3 5 40 45 

Tyr Arg Asn Gly Ala Ala lie Thr Gly Val Thr Asp Arg Asn Thr Val 
50 55 S0~ 

Val Asn His Phe Phe Asp Thr Pro lie Arg Ala Arg Ser lie Ala lie 
65 70 7 5 80 

Kis Pro Leu Thr Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids , 
. . ... ' i 



(B) TYPE: amino acid 

(C) STRANDED NESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



H* (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Asp Leu Xaa Xaa Xaa Xaa Xaa Val Thr Gly lie lie Thr Gin Glv Xaa 
15 10 15^ 

Xaa Xaa Lys Xaa Xaa Xaa Xaa Xaa Xaa Phe Val Xaa Ser Tvr Lys lie 

2 0 2 5 30 

Xaa Tyr Ser Xaa Asp Gly Xaa Xaa Trr> Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 " 45 

Xaa Xaa Lys Xaa Lys Val Phe Xaa Gly Asn Thr Aso Xaa Xaa Thr Xaa 
50 55 S0~ 

Xaa Xaa Asn Xaa Phe Xaa Xaa Pro lie Xaa Xaa Arg Phe lie Arg Xaa 
65 70 75 80 

Xaa Pro Xaa Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 9 : 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2303 base oairs 
(3) TYPE: nucleic acid" 

(C) STRANDEDNSSS : single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 619.. 2058 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 





GAATTCCGGT 


TAACTGAGGA 


CAAAGGGTAA 


TGCAGAAGTG 


ATATTTGATT 


TCCATTCTCA 


60 




TTCCCAGTGG 


CCTTGATATT 


TAAACTGATT 


CCTGCCACCA 


GGTCCTTGGG 


CCACCCTGTC 


120 




CCTGCGTCTC 


ATATTTCTGC 


ATGCTGCTTT 


GTTTGTATAT 


AGTGCGCTCC 


TGGCCTCAGG 


180 




CTCGCTCCCC 


TCCAGCTCTC 


GCTTCATTGT 


TCTCCAAGTC 


AGAAGCCCCC 


GCATCCGCCG 


240 


£ : 
iZ'i 


CGCAGCAGCG 


TGAGCCGTAG 


TCACTGCTGG 


CCGCTTCGCC 


TGCGTGCGCG 


CACGGAAATC 


300 


r '-ss 
s _ : 


GGGGAGCCAG 


GAACCCAAGG 


AGCCGCCGTC 


CGCCCGCTGT 


GCCTCTGCTA 


GACCACTCGC 


360 


Ui 

J? 
"it 


AGCCCCAGCC 


TCTCTCAAGC 


GCACCCACCT 


CCGCGCACCC 


CAGCTCAGGC 


GAAGCTGGAG 


420 


** 


TGAGGGTGAA 


TCACCCTTTC 


TCTAGGGCCA 


CCACTCTTTT 


ATCGCCCTTC 


CCAAGATTTG 


480 


JSPS 


AGAAGCGCTG 


CGGGAGGAAA 


GACGTCCTCT 


TGATCTCTGA 


CAGGGCGGGG 


TTTACTGCTG 


540 


*: 


TCCTGCAGGC 


GCGCCTCGCC 


TACTGTGCCC 


TCCGCTACGA 


CCCCGGACCA 


GCCCAGGTCA 


600 



CGTCCGTGAG AAGGGATC ATG AAG CAC TTG GTA GCA GCC TGG CTT TTG GTT 651 

Met Lys His Leu Val Ala Ala Trr> Leu Leu Val 
*« 1 5 10 



GGA CTC AGC CTC GGG GTG CCC CAG TTC GGC AAA GGT GAC ATT TGC AAC 69 9 

Gly Leu Ser Leu Gly Val Pro Gin Phe Giv Lvs Giv Aso lie Cys Asn 

15 20 " " 25 

CCG AAC CCC TGT GAA AAT GGT GGC ATC TGT CTG TCA GGA CTG GCT GAT 74 7 

Pro Asn Pro Cys Glu Asn Gly Gly lie Cys Leu Ser Gly Leu Aso 
30 35 40 

GAT TCC TTT TCC TGT GAG TGT CCA GAA GGC TTC GCA GGT CCG AAC TGC 795 
Asp Ser Phe Ser Cys Glu Cys Pro Glu Giv Phe Ala Gly ?-o Asn Cvs 
45 50 ~ 55 

TCT AGT GTT GTG GAG GTT GCA TCA GAT GAA GAA AAG CCT ACT TC A GCA 84 3 

Ser Ser Val Val Glu Val Ala Ser Aso Glu Glu Lys Pro Thr Ser Ala 
60 65 70 75 

GGT CCC TGC ATC CCT AAC CCA TGC CAT AAC GGA GGA ACC TG~ GAG ATA 
Gly Pro Cys He Pro Asn Pro Cys His Asn Giv Giv Thr Cvs Glu He 

80 85 90 

AGC GAA GCC TAT CGA GGA GAC AC A TTC ATA GGC TAT C-TT vqt AAA t G T 
Ser Glu Ala Tyr Arg Gly Asp Thr Phe He Giv Tvr Val Cvs Lvs Cvs 

CCT CGG GGA TTT AAT GGG ATT CAC TGT CAG CAC AAT ATA AAT GAA t GT 
Pro Arg Gly ?he Asn Gly He His Cys Gin His Asn He Asn Glu Cvs 
110 H5 120 

GAA GCT GAG CCT TGC AGA AAT GGC GGA ATA TGT ACC GAC CTT GTT GCT 1035 



891 



939 



987 
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Glu Ala Glu 
125 

AAC TAC TCT 
Asn Tvr Ser 
140 

TAT AAA TGC 
Tyr Lys Cys 



CAG CAA ATC 
Gin Gin lie 



AAG TGG TAT 
Lys Trt) Tyr 
190 

* 

5 t 

f ;:: Mt 

i|] GCC TGG ACA 

m Ala Trp Thr 

1=1 205 

^1 " TTG CAA AGA 

fl3 Leu Gin Arg 

|S 220 

AGG ATT GGA 
•' : Arg lie Gly 



AAT GAC GGG 
Asn Asp Gly 



GAG ATG GTC 
Glu Met Val 
270 

TCT TTC ACA 
Ser Phe Thr 
285 

ATT TGT CGA 
lie Cys Arg 
300 

CTC TCA GGC 
Leu Ser Gly 



GAC TAC CAG 
Asp Tyr Gin 



ATG TTT ACT 
Met: Phe Thr 
350 

GTA AAT GCC 
Val Asn Ala 
365 



Pro Cys Arg Asn Gly 

130 

TGT GAA TGC CCA GGA 
Cys Glu Cys Pro Gly 
145 

TCT GGG CAC TTG GGA 
Ser Gly His Leu Gly 
160 

ACA GCT TCA TCT AAT 
Thr Ala Ser Ser Asn 
175 

CCC TAC TAT GCT CGA 
Pro Tyr Tvr Ala Arg 

19 5 

GCT GCT GAA AAT GAC 
Ala Ala Glu Asn Aso 

210 

AAA ATG AGA GTC ACT 
Lys Met Arg Val Thr 
225 

AGC CCA GAG TAC ATA 
Ser Pro Glu Tvr lie 
240 

AAG ACC TGG GCA ATG 
Lys Thr Tro Ala Met 
255 

TTT CGT GGA AAT GTT 
Phe Arg Glv Asn Val 

275 

CCC CCA ATC AAA GCT 
Pro Pro lie Lys Ala 

290 

AGG CAT TGT ACT TTA 
Arg His Cys Thr Leu 
305 

TGT TCA GAA CCT TTG 
Cvs Ser Glu Pro Leu 
320 

ATC ACT GCC TCC AGC 
lie Thr Ala Ser Ser 
335 

TGG GAA CCA AGG AAA 
Trp Glu Pro Arg Lys 

TGG ACT TCC GGC CAT 
Tro Thr Ser Glv His 

370 



Gly lie Cvs Thr Asp 

13 5 

GAA TTT ATG GGA CGA 
Glu Phe Met Gly Arg 
150 

ATC GAA GGT GGG ATC 
lie Glu Gly Gly lie 
165 

CAC CGA GCT CTT TTT 
Kis Arg Ala Leu Phe 
180 

CTT AAT AAG AAG GGC 
Leu Asn Lys Lys Gly 

200 

AGA TGG CCA TGG ATT 
Arg Trp Pro Trp lie 

215 

GGT GTT ATT ACC CAA 
Glv Val lie Thr Gin 
230 

AAA TCC TAC AAA ATT 
Lys Ser Tyr Lys lie 
245 

TAC AAA GTA AAA GGC 
Tvr Lys Val Lys Glv 
2 60 

GAT AAC AAC ACA CCA 
Aso Asn Asn Thr Pro 

280 

CAG TAT GTA AGA CTC 
Gin Tyr Val Arg Leu 

295 

AGA ATG GAA CTT CTT 
Arg Met Glu Leu Leu 
310 

GGG ATG AAA TCA GGG 
Glv Met Lvs Ser Glv 
32 5 

GTC TTC AGA ACA CTC 
Val Phe Arg Thr Leu 
340 

GCC AGG CTG GAC AAG 

Ala Arg Leu A so Lvs 

350 

AAC GAC CAG TCA CAA 
Asn Asp Gin Ser Gin 

375 



Leu Vai Ala 



AAT TGT CAA 1083 
Asn Cys Gin 
155 

ATA TCT AAT 1131 
lie Ser Asn 
170 

GGA CTC CAG 1179 

Gly Leu Gin 

185 

CTT ATA AAT 1227 
Leu lie Asn 



CAG ATA AAT 1275 
Gin lie Asn 



GGA GCA AAA 1323 
Gly Ala Lvs 
23 5 

GCC TAC AGC 1371 
Ala Tyr Ser 
250 

ACC AAT GAA 1419 

Thr Asn Glu 

265 

TAT GCT AAT 14 6 7 

Tyr Ala Asn 

TAC CCC CAA 1515 
Tyr Pro Gin 



GGC TGT GAG 1563 
Gly Cys Glu 
315 

CAT ATA CAA 1611 
His He Gin 
330 

AAC ATG GAC 1659 

Asn Met Aso 

345 

CAA GGC AAA 17 07 

Gin- Glv Lvs 

TGG TTA CAG 17 5 5 

Trp Leu Gin 
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Asn 

Gly 

Gly 

Arg 

Cys 
145 

His 

□ Ser 

V'i \ 

' "i 

#J3 Glu 

••<< 
: = 

r "" Arg 
La Glu 
fj] Trp 

r f m 

r.E 

? ' J Gly 

lie 

Cys 
305 

Glu 

Ala 

Pro 

Ser 

Pro 
385 

His 



Pro Cys His 



Asp Thr Phe 
100 

lie His Cys 
115 

Asn Gly Gly 
130 

Pro Gly Glu 



Leu Gly lie 



Ser Asn His 
180 

Ala Arg Leu 
195 

Asn Asd Arg 
210 

Val Thr Gly 



Tyr lie Lys 



Ala Met Tyr 
260 

Asn Val Asp 
275 

Lys Ala Gin 
290 

Thr Leu Arg 



Pro Leu Gly 



Ser Ser Val 
340 

Arg Lys Ala 
3 5 5 

Glv His Asn 
370 

Thr Lys Val 



Val Gin Phe 



Asn Glv Gly 
85 

lie Gly Tyr 



Gin His Asn 



lie Cys Thr 
135 

Phe Met Gly 
150 

Glu Gly Gly 
165 

Arg Ala Leu 



Asn Lys Lys 



Trp Pro Trp 
215 

Val lie Thr 
230 

Ser Tvr Lys 
245 

Lys Val Lys 



Asn Asn Thr 



Tyr Val Arg 
295 

Met Glu Leu 
310 

Met Lys Ser 
32 5 

Phe Arg Thr 



Arg Leu Asp 



Asp Gin Ser 
375 

Thr Glv lie 
390 

Val Gly Ser 
405 



Thr Cys Glu 
90 

Val Cys Lys 
105 

lie Asn Glu 
120 

Asp Leu Val 



Arg Asn Cys 



lie lie Ser 
170 

Phe Glv Leu 
185 

Gly Leu lie 
200 

lie Gin lie 



Gin Gly Ala 



lie Ala Tyr 
250 

Gly Thr Asn 
265 

Pro Tvr Ala 
280 

Leu Tyr Pro 



Leu Gly Cys 



Gly His lie 
330 

Leu Asn Met 
345 

Lys Gin Glv 
360 

Gin Trp Leu 



lie Thr Gin 



Tyr Lys Leu 
410 



lie Ser Glu 



Cys Pro Arg 



Cys Glu Ala 
125 

Ala Asn Tyr 
140 

Gin Tyr Lys 
155 

Asn Gin Gin 



Gin Lys Trp 



Asn Ala TrD 
205 

Asn Leu Gin 
220 

Lvs Arg lie 
235 

Ser Asn Asp 



Glu Glu Met 



Asn Ser Phe 
235 

Gin He Cvs 
300 

Glu Leu Ser 
315 

Gin Asp Tyr 



Asp Met Phe 



Lvs Val Asn 
365 

Gin Val Asd 
380 

Glv Ala Lvs 
395 

Ala Tyr Ser 



Ala Tvr Arg 
95 

Glv Phe Asn 
110 

Glu Pro Cys 



Ser Cys Glu 



Cys Ser Glv 
160 

He Thr Ala 
175 

Tyr Pro Tyr 
190 

Thr Ala Ala 



Arg Lys Met 



Gly Ser Pro 
240 

Gly Lys Thr 
255 

Val Phe Arg 
2 70 

Thr Pro Pro 



Arg Arg His 



Gly Cys Ser 
320 

Gin He Thr 
335 

Thr Tro Glu 
350 

Ala Trr> Thr 



Leu Leu Val 



Asp Phe Glv 
400 

Asn Asd Glv 
415 
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Glu His Trp Met Val His Gin Asp Giu Lvs Gin Arg Lys Asp Lys Vai 

420 42 5 " 430 

Phe Gin Gly Asn Phe Asp Asn Asp Thr His Arg Lvs Asn Val lie Asp 
435 440 445 

Pro Pro lie Tyr Ala Arg Phe lie Arg lie Leu Pro Trp Ser Trp Tyr 

450 45 5 460 

Gly Arg lie Thr Leu Arg Ser Glu Leu Leu Gly Cys Ala Giu Glu Glu 
465 470 475 480 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 1779 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TCT CTT TAG TCA CCA CTC TCG CCC TCT CCA AGA ATT TGT TTA ACA AAG 

Ser Leu "* Ser Pro Leu Ser Pro Ser Pro Arg lie Cys Leu Thr Lys 
1 5 10 " 15 

CGC TGA GGA AAG AGA ACG TCT TCT TGA ATT CTT TAG TAG GGG CGG AGT 

Arg * Gly Lys Arg Thr Ser Ser * He Leu * * Glv Arg Ser 

20 2 5 30 

CTG CTG CTG CCC TGC GCT GCC ACC TCG GCT ACA CTG CCC TCC GCG ACG 

Leu Leu Leu Pro Cys Ala Ala Thr Ser Ala Thr Leu Pro Ser Ala Thr 
35 40 45 

ACC CCT GAC CAG CCG GGG TCA CGT CCG GGA GAC GGG ATC ATG AAG CGC 

Thr Pro Asp Gin Pro Gly Ser Arg Pro Gly Asp Glv He Met Lvs Arg 
50 55 ' ' " 60 

TCG GTA GCC GTC TGG CTC TTG GTC GGG CTC AGC CTC GGT GTC CCC CAG 

Ser Val Ala Val Trp Leu Leu Val Gly Leu Ser Leu Gly Val Pro Gin 

65 70 75 80 

TTC GGC AAA GGT GAT ATT TGT GAT CCC AAT CCA TGT GAA AAT GGA GGT 

Phe Gly Lys Gly Asp He Cys Asp Pro Asn Pro Cvs Glu Asn Glv Glv 

85 90 " 95 

ATC TGT TTG CCA GGA TTG GCT GTA GGT TCC TTT TCC TGT GAG TGT CCA 

He Cys Leu Pro Gly Leu Ala Val Glv Ser Phe Ser Cvs Glu- Cvs Pro 

100 " 105 HO 

GAT GGC TTC ACA GAC CCC AAC TGT TCT AGT GTT GTG GAG GTT GCA TCA 

Asp Gly Phe Thr Asp Pro Asn Cvs Ser Ser Vai Val Glu Val Ala Ser 
115 " 120 125 
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GAT GAA GAA GAA CCA ACT TCA GCA GGT CCC TGC ACT CCT AAT CCA TGC 
Asp Glu Glu Glu Pro Thr Ser Ala Gly Pro Cys Thr Pro Asn Pro Cys 
130 135 140 



CAT AAT GGA GGA ACC TGT GAA ATA AGT GAA GCA TAC CGA GGG GAT ACA 

His Asn Gly Gly Thr Cys Glu lie Ser Glu Ala Tyr Arg Gly Asp Thr 

145 " 150 155 160 

TTC ATA GGC TAT GTT TGT AAA TGT CCC CGA GGA TTT AAT GGG ATT CAC 

Phe lie Gly Tyr Val Cys Lys Cys Pro Arg Gly Phe Asn Gly lie Kis 

165 - " 170 175 

TGT CAG CAC AAC ATA AAT GAA TGC GAA GTT GAG CCT TGC AAA AAT GGT 

Cys Gin His Asn lie Asn Glu Cys Glu Val Glu Pro Cys Lys Asn Gly 

180 ' 185 ~ 190 

GGA ATA TGT ACA GAT CTT GTT GCT AAC TAT TCC TGT GAG TGC CCA GGC 

Gly lie Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu Cys Pro Gly 

195 200 205 

GAA TTT ATG GGA AGA AAT TGT CAA TAC AAA TGC TCA GGC CCA CTG GGA 

Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cys Ser Gly Pro Leu Gly 

210 ' 215 ' " 220 

ATT GAA GGT GGA ATT ATA TCA AAC CAG CAA ATC ACA GCT TCC TCT ACT 

lie Glu Gly Gly lie lie Ser Asn Gin Gin lie Thr Ala Ser Ser Thr 

225 230 235 240 

CAC CGA GCT CTT TTT GGA CTC CAA AAA TGG TAT CCC TAC TAT GCA CGT 

His Arg Ala Leu Phe Gly Leu Gin Lys Tro Tyr Pro Tvr Tyr Ala Arg 

245 " "* 250 " ' 255 

CTT AAT AAG AAG GGG CTT ATA AAT GCG TGG ACA GCT GCA GAA AAT GAC 

Leu Asn Lys Lys Gly Leu lie Asn Ala Tro Thr Ala Ala Glu Asn Asp 

260 265 270 

AGA TGG AAG CGG TGG ATT CAG ATA AAT TTG CAA AGA AAA ATG AGA GTT 

Arg Trp Lys Arg Trp lie Gin lie Asn Leu Gin Arg Lvs Met Arg Val 

275 " " 280 235 

ACT GGT GTG ATT ACC CAA GGG GCC AAG AGG ATT GGA AGC CCA GAG TAT 
Thr Gly Val He Thr Gin Gly Ala Lys Arg Tie Gly Ser Pro Glu Tyr 

290 295 300 

ATA AAA TTC TAC AAA ATT GCC TAC AGT AAT GAT GGA AAG ACT TGG GCA 

He Lys Phe Tyr Lys He Ala Tvr Ser Asn Aso Gly Lvs Thr Trt> Ala 

305 310 315 " " 320 

ATG TAC AAA GTG AAA GGC ACC AAT GAA GAC ATG GTG TTT CGT GGA AAC 
Met. Tyr Lys Val Lys Gly Thr Asn Glu Aso Mer Val Phe Arg Gly Asn 

325 330 ~ 335 

ATT GAT AAC AAC ACT CCA TAT GCT AAC TCT TTC ACA CCC CCC ATA AAA 

He Asp Asn Asn Thr Pro Tyr Ala Asn Ser Phe Thr Pro Pro He Lys 

340 345 350 

GCT CAG TAT GTA AGA CTC TAT CCC CAA GTT TGT CGA AGA CAT- TGC ACT 

Ala Gin Tyr Val Arg Leu Tvr Pro Gin Val Cvs Arg Arg His Cys Thr 

355 350 " 355 

TTG CGA ATG GAA CTT CTT GGC TGT GAA CTG TCG GGT TGT TCT GAG CCT 

Leu Arg Met Glu Leu Leu Gly Cys Glu Leu Ser Glv Cvs Ser Glu Pro 

370 375 " 330 
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CTG GGT ATG AAA TCA GGA CAT ATA CAA GAC TAT CAG ATC ACT GCC TCC 
Leu Gly Met Lys Ser Gly His lie Gin Asp Tyr Gin lie Thr Ala Ser 
385 ' " 390 395 400 



1200 



AGC ATC TTC AGA ACG CTC AAC ATG GAC ATG TTC ACT TGG GAA CCA AGG 
Ser lie Phe Arg Thr Leu Asn Met Asp Met Phe Thr Trp Glu Pro Arg 

405 " 410 415 



1248 



AAA GCT CGG CTG GAC AAG CAA GGC AAA GTG AAT GCC TGG ACC TCT GGC 
Lys Ala Arg Leu Asp Lys Gin Gly Lys Val Asn Ala Trp Thr Ser Gly 

420 " " 42 5 ~ 430 



1296 



CAC AAT GAC CAG TCA CAA TGG TTA CAG GTG GAT CTT CTT GTT CCA ACC 
His Asn Asp Gin Ser Gin Trp Leu Gin Val Asp Leu Leu Val Pro Thr 
435 440 " 445 



1344 



3 : I 



AAA GTG ACT GGC ATC ATT ACA CAA GGA GCT AAA GAT TTT GGT CAT GTA 

Lys Val Thr Gly lie lie Thr Gin Gly Ala Lys Asp Phe Gly His Val 

450 455 460 

CAG TTT GTT GGC TCC TAC AAA CTG GCT TAC AGC AAT GAT GGA GAA CAC 

Gin Phe Val Gly Ser Tyr Lys Leu Ala Tyr Ser Asn Asp Gly Glu His 

465 " 470 ' " 475 480 



1392 



1440 



TGG ACT GTA TAC CAG GAT GAA AAG CAA AGA AAA GAT AAG GTT TTC CAG 
Trp Thr Val Tyr Gin Asp Glu Lys Gin Arg Lys Asp Lys Val Phe Gin 

485 490 ' 495 



1488 



GGA AAT TTT GAC AAT GAC ACT CAC AGA AAA AAT GTC ATC GAC CCT CCC 
Gly Asn Phe Asp Asn Asp Thr His Arg Lys Asn Val lie Asp Pro Pro 

500 505 510 



1536 



ATC TAT GCA CGA CAC ATA AGA ATC CTT CCT TGG TCC TGG TAC GGG AGG 
lie Tyr Ala Arg Kis lie Arg lie Leu Pro Tro Ser Trn Tyr Giv Arg 
515 " 520 ~ 525 



1584 



ATC ACA TTG GCG TCA GAG CTG CTG GGC TGC ACA GAG GAG GAA TGA GGG 
lie Thr Leu Ala Ser Glu Leu Leu Gly Cys Thr Glu Glu Glu * Gly 
530 53 5 * 540 



1632 



GAG GCT ACA TTT CAC AAC CGT CTT CCC TAT TTG GGT AAA AGT ATC TCC 
Glu Ala Thr Phe His Asn Arg Leu Pro Tyr Leu Gly Lys Ser lie Ser 
545 550 555 560 



1680 



ATG GAA TGA ACT GTG TAA AAT CTG TAG GAA ACT GAA TGG TTT TTT TTT 
Met Glu * Thr Val * Asn Leu * Glu Thr Glu Trp Phe Phe Phe 

565 570 ~ 575 



1728 



TTT TCA TGA AAA AGT GGT CAA ATT ATG GTA GGC AAC TAA CGG TGT TTT 
Phe Ser ~ Lys Ser Gly Gin lie Met Val Gly Asn * Arg Cys Phe 

580 585 590 



1776 



TAC C 
Tvr 



1780 



(2 ) _ INFORMATION FOR SSQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
( D } TOPOLOGY : 1 inear 
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(ii) MOLE CULS TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser Leu * Ser Pro Leu Ser Pro Ser Pro Arg lie Cys Leu Thr Lvs 
1 5 10 ' 15 

Arg 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GIv Lys Arg Thr Ser Ser 
I 5 

(2) INFORMATION FOR SEQ ID NO: 14: 



M] (i) SEQUENCE CHARACTERISTICS: 

js* (A) LENGTH: 517 amino acids 

(3) TYPE: amino acid 

(D) TOPOLOGY: linear 

H (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

lie Leu * * Gly Arg Ser Leu Leu Leu Pro Cvs Ala Ala Thr Ser 
1 5 10 15 

Ala Thr Leu Pro Ser Ala Thr Thr Pro Asp Gin Pro Gly Ser Arg Pro 

20 2 5 " " 30 

Gly Asp Glv lie Met Lys Arg Ser Vai Ala Val Trn Leu Leu Val Gly 
35 40 * 45 

Leu Ser Leu Gly Val Pro Gin Phe Gly Lys Gly Asr> lie Cys Asn Pro 
50 55 " ' 50~ 

Asn Pro Cys Giu Asn Gly Gly lie Cys Leu Pro Gly Leu Ala Val Gly 
65 70 " 75 ~ 80 

Ser Phe Ser Cys Glu Cys Pro Asp Gly Phe Thr Asp Pro Asn Cys Ser 

S5 90 " 95 

Ser Vai Val Glu Val Ala Ser Asp Glu Giu Giu Pro Thr Ser Ala Gly 

100 105 110 

Pro Cys Thr Pro Asn Pro Cys His Asn Gly Gly Thr Cvs Giu He Ser 
115 ' 120 " 1-25 

Glu Ala Tyr Arg Glv Aso Thr Phe He Glv Tvr Vai Cvs Lys Cvs Pro 
130 135 ' " 140 

Arg Gly Phe Asn Gly He His Cys Gin His Asn He Asn Glu Cys Glu 
145 150 155 " 160 



\ 
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Val Glu Pro Cys Lvs Asn Gly Glv H e Cvs Thr Ast> Leu Val Ala Asn 

165 170 " 175 



Tvr Ser Cys Glu Cvs Pro Gly Glu Phe Met Glv Arg Asn Cys Gin Tvr 

ISO " 185 ' 190 

Lys Cys Ser Gly Pro Leu Glv lie Glu Gly Gly lie lie Ser Asn Gin 

195 200 * " 205 

Gin lie Thr Ala Ser Ser Thr His Arg Ala Leu Phe Gly Leu Gin Lys 

210 215 " 220 

Trp Tyr Pro Tyr Tyr Ala Arg Leu Asn Lys Lys Glv Leu lie Asn Ala 

225 *" 230 235 240 

Trp Thr Ala Ala Glu Asn Asp Arg Trp Lys Arg Trp lie Gin lie Asn 

245 250 " 255 

Leu Gin Arg Lys Met Arg Val Thr Gly Val He Thr Gin Gly Ala Lvs 

260 265 270 

Arg He Gly Ser Pro Glu Tyr He Lys Phe Tvr Lvs He Ala Tvr Ser 

27 5 280 ' 2 85 

Asn Asn Gly Lys Thr Trp Ala Met: Tyr Lys Val Lvs Gly Thr Asn Glu 

290 295 300 

Asp Met Val Phe Arg Gly Asn He Asp Asn Asn Thr Pro Tyr Ala Asn 

305 310 315 " 320 

Ser Phe Thr Pro Pro He Lys Ala Gin Tyr Val Arg Leu Tyr Pro Gin 

325 330 " " 335 

Val Cys Arg Arg His Cys Thr Leu Arg Met Glu Leu Leu Gly Cys Glu 

340 " 345 350 

Leu Ser Gly Cys Ser Glu Pro Leu Gly Met Lys Ser Gly His He Gin 

3 55 3 60 ' ' 3 65 

Asp Tyr Gin He Thr Ala Ser Ser He Phe Arg Thr Leu Asn Met Aso 

370 375 ' 330 

Met Phe Thr Trp Glu Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys 

385 390 " 398 " 400 

Val Asn Ala Trp Thr Ser Gly His Asn Asp Gin Ser Gin Trp Leu Gin 

405 410 ~ 415 

Val Asp Leu Leu Val Pro Thr Lys Val Thr Gly lie He Thr Gin Glv 

420 " 42 5 ' 430 

Ala Lys Asp Phe Gly His Val Gin Phe Val Gly Ser Tyr Lys Leu Ala 

43 5 440 445 

Tyr Ser Asn Asp Gly Glu His Trp Thr Val Tyr Gin Asu Glu Lys Gin 

450 45 5 ~ ' 450 

Arg Lys Asp Lys Val Phe Gin Gly Asn Phe Asp Asn Asp Thr His Arg 

465 ' " 470 475 430 

Lys Asn Val He Asp Pro Pro lie Tyr Ala Arg His He Arg lie Leu 

485 490 495 



1 

T 



Pro Trp Ser Trp Tyr Gly Arg lie Thr Leu Ala Ser Glu Leu Leu Gly 

500 505 510 

Cys Thr Glu Glu Glu 
515 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Glu Ala Thr Phe His Asn Arg Leu Pro Tyr Leu Gly Lys Ser lie 
1 5 10 15 

Ser Met Glu * Thr Val * Asn Leu 

20 25 



H ■ (2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

Glu Thr Glu Tro Phe Phe Phe Phe Ser 
I " 5 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

Lys Ser Gly Gin lie Met Val Gly Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO: IS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Arg Cys Phe Tyr 
1 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GACAGATGGC CATGGATTCA GATAAATTTG CAAAGAAAAA TGAGAGTCAC TGGTGTTATT 60 

ACCCAAGGAG CAAAAAGGAT TGGAAGCCCA GAGTACATAA AATCCTACAA AATTGCCTAC 120 

AG CAATG ACG GGAAGACCTG GGCAATGTAC AAAGTAAAAG GCACCAATGA AGAGATGGTC 180 

TTTCGTGGAA ATGTTGATAA CAACACACCA TATGCTAATT CTTTCACACC CCCAATCAAA 240 

U GCTCAGTATG TAAGACTCTA CCCCCAAATT TGTCGAAGGC ATTGTACTTT AAGAATGGAA 300 

!L CTTCTTGGCT GTGAGCTC 318 

H (2) INFORMATION FOR SEQ ID NO: 20: 

sS (i) SEQUENCE CHARACTERISTICS : 

U l (A) LENGTH: 320 amino acids 

(3) TYPE: amino acid 
?J3 (C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



: : 



T : 
! S : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Cys Ser Thr Gin Leu Gly Met Glu Gly Gly Ala lie Ala Asa Ser Gin 
1 5 10 15 

lie Ser Ala Ser Tyr Val Tyr Met Gly Phe Met Glv Leu Gin Arg Tro 

20 25' " 30 

Gly Pro Glu Leu Ala Arg Leu Tyr Arg Thr Gly He Val Asn Ala Trp 



35 40 



45 



His Ala Ser Asn Tyr Asp Xaa Ser Lys Pro Trp He Gin Val Asn Leu 
50 55 50 

Leu Arg Lys Met Arg Val Ser Gly Val Met Thr Gin Gly Ala Ser Arg 
55 70 75 ' 30" 

Ala Gly Arg Ala Glu Tyr Leu Lys Thr Phe Lys Val Ala Tyr Ser Leu 

85 ' 90 * 95 



r 
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Astd Gly Xaa Arg Lys Phe Glu Phe lie Gin Aso Glu Ser Gly Gly Asp 

100 105 * 110 

Lys Glu Phe Leu Gly Asn Leu Aso Asn Asn Ser Leu Lys Val Asn Met 
115 120 125 

Phe Asn Pro Thr Leu Glu Ala Gin Tyr lie Arg Leu Tyr Pro Val Ser 
130 135 140 

Cys His Arg Gly Cys Thr Leu Arg Phe Glu Leu Leu Gly Cys Glu Leu 
145 ~ " ' 150 " 155 ' 160 

His Gly Cys Leu Glu Pro Leu Glv Leu Lys Asn Asn Thr lie Pro Asp 

165 170 175 

Ser Gin Met Ser Ala Ser Ser Ser Tyr Lys Thr Trt> Asn Leu Arg Ala 

180 135 ~ 190 

Phe Gly Trp Tyr Pro His Leu Glv Arg Leu Asn Asn Gin Glv Lys lie 
195 "* 200 ~ 205 

Asn Ala Trp Thr Ala Gin Ser Asn Ser Ala Lys Glu Trr> Leu Gin Val 
210 " 215 " 220 

Asp Leu Gly Thr Gin Arg Gin Val Thr Glv lie lie Thr Gin Gly Ala 
225 230 " 235 240 

Arg Asp Phe Gly His He Gin Tyr Val Glu Ser Tyr Lys Val Ala His 

245 250 " " 255 

Ser Asp Asp Gly Val Gin Trp Thr Val Tvr Xaa Xaa Glu Glu Gin Glv 

260 265 " 270 



Ul Ser Ser Lys Val Phe Gin Glv Asn Leu Aso Asn Asn Ser His Lvs Lys 

275 ~ 280 235 



Asn He Phe Glu Lys Pro Phe Met Ala Arg Tyr Val Arg Val Leu Pro 

290 295 " - 300 

Val Ser Trp His Asn Arg He Thr Leu Arg Leu Glu Leu Leu Gly Cvs 
305 310 ' 315 " 320 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 amino acids 

(B) TYPE: amino acid 

(C) STRANDED NESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

* Cys Ser Gly Pro Leu Gly He Glu Gly Glv He He Ser Asn Gin Gin 
1 5 " 10~ 15 

He Thr Ala Ser Ser Thr His Arg Ala Leu Phe Glv Leu Gin Lys Tro 

20 "25 " 30 
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Tyr Pro Tyr Tyr 
3 5 

Thr Ala Ala Glu 
50 

Gin Arg Lys Met: 
65 

lie Gly Ser Pro 



Asp Gly Lys Thr 

100 

Met Val Phe Arg 
115 

Phe Thr Pro Pro 
130 

Cys Arg Arg His 
145 

Ser Gly Cys Ser 



Tyr Gin lie Thr 

180 

Phe Thr Trp Glu 
195 

Asn Ala Trp Thr 
210 

Xaa Leu Leu Val 
225 

Lys Aso Xaa Glv 



Ser Asn Asp Glv 

2 60 

Lys Asp Lys Val 
27 5 

Asn Val He Aso 
290 

Trp Ser Tro Tvr 
305 



Ala Arg Leu Asn 

40 

Asn Asp Arg Trp 
55 

Arg Val Thr Gly 
70 

Glu Tyr lie Lvs 
85 

Trp Ala Met Tyr 



Gly Asn lie Aso 

120 

lie Lys Ala Gin 
13 5 

Cys Thr Leu Arg 
150 

Glu Pro Leu Gly 
165 

Ala Ser Ser lie 



Pro Arg Lvs Ala 

2 00 

Ser Gly His Asn 
215 

Pro Thr Lvs Val 
230 

His Val Gin Phe 
245 

Glu His Tro Thr 



Xaa Gin Gly Asn 

2 80 

Pro Pro lie Tvr 
295 

Gly Arg lie Thr 
310 



Lys Lys Gly Leu 



Asn Arg Tro lie 

60 

Val He Thr Gin 
75 

Phe Tyr Lys He 
90 

Lys Val Lys Gly 
105 

Asn Asn Thr Pro 



Tyr Val Arg Leu 

140 

Met Glu Leu Leu 
155 

Met Lys Ser Glv 
17 0 

Phe Arg Thr Leu 
185 

Arg Leu Asp Lys 



Asp Gin Ser Gin 

220 

Thr Gly He He 
235 

Val Gly Ser Tvr 
250 

Val Xaa Gin Asr> 
265 

Phe Asp Asn Asp 



Ala Arg His He 

300 

Leu Ala Ser Glu 
315 



He Asn Ala Trp 
Gin He Asn Leu 



Gly Ala Lys Arg 

80 

Ala Tvr Ser Asn 
95 

Thr Asn Glu Aso 
110 

Tyr Ala Asn Ser 
125 

Tyr Pro Gin Val 



Gly Cys Glu Leu 

160 

His lie Gin Aso 
175 

Asn Met Asp Met: 
190 

Gin Gly Lys Val 
205 

Trp Leu Gin Val 



Thr Gin Glv Ala 

240 

Lys Leu Ala Tyr 
25 5 

Glu Lys Gin Arg 
270 

Thr His Arg Lys 
285 

Arg He Leu Pro 



Leu Leu Glv Cys 

320 



INFORMATION FOR SEQ ID NO: 22: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 amino acids 
(3) TYPE: amino acid 
(C) STRANDED NESS : 



J 

1 



t 



(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPS: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Lys Arg Ser Val Ala Val Tiro Leu Leu Val Glv Leu Ser Leu Gly 
1 5 10 ' 15 

Val Pro Gin Phe Gly Lys Gly Asp lie 

20 25 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 
(3) TYPE: amino acid 
;1 ( C ) STRANDEDNESS : 

l ^ (D) TOPOLOGY: unknown 

y 

4 (ii) MOLECULE TYPE: peptide 



V * 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Cys Asp Pro Asn Pro Cys Glu Asn Gly Gly lie Cys Leu Pro Gly Leu 
1 " 5 10 15 

Ala Val Gly Xaa Xaa Xaa Xaa Xaa Ser Phe Ser Cys Glu Cys Pro Asr> 

20 25 30 



iU Glv Phe Thr Asp Pro Asn Cys Ser Ser Val Val Glu Val Ala Ser Aso 

35 ~ 40 45 

Glu Glu Glu Pro Thr Ser Ala Glv Pro 
50 55 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPS: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Cys Thr Pro Asn Pro Cys His Asn Gly Gly Thr Cvs Glu lie Ser Glu 
" 1 5 10 15 

Ala Tyr Arg Gly Asp Thr Phe He Gly Tvr Vai Cys Lvs Cvs Pro Arg 

20' *" 2 5^ ' 30 
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Gly Phe Asn Gly He His Cys Gin His Asn lie 
35 40 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Cys Glu Val Glu Pro Cys Lys Asn Gly Glv He Cvs Thr Ast> Leu Val 
1 5 10 ~ 15 

Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Tvr Ser Cvs Glu Cvs ^ro Glv 

20 25 " - 30 * 

Glu Phe Met Gly Arg Asn Cys Gin Tvr Lys 
35 40 

(2) INFORMATION FOR SEQ ID NO: 26: ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: oeotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Cys Xaa Xaa Xaa Pro Cys Xaa Asn Gly Gly Xaa Cys Xaa Xaa Xaa Xaa 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tvr Xaa Cvs Xaa Cvs Xaa Xaa 

20 25 " 36 

Gly Tyr Xaa Gly Xaa Xaa Cys Xaa 
35 40 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base oairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 



-90- 



I 

> 



! 

t 



(A) NAME /KEY : CDS 

(B) LOCATION: 1..309 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 7 : 

NGTGATATTT GTGATCCCAA TCCATGTGAA AATGGAGGTA TCTGTTTGCC AGGATTGGCT 50 

GTAGGTTCCT TTTCCTGTGA GTGTCCAGAT GGCTTCACAG ACCCCAACTG TTCTAGTGTT 120 

GTGGAGGTTG GTCCCTGCAC TCCTAATCCA TGCCATAATG GAGGAACCTG TGAAATAAGT 180 

GAAGCATACC GAGGGGATAC ATTCATAGGC TATGTTTGTA AATGTCCCCG AGGATTTAAT 240 

GGGATTCACT GTCAGCACAA CATAAATGAA TGCGAAGTTG AGCCTTGCAA AAATGGTGGA 300 

ATATGTACAG 310 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D ) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) > 

( ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 549 -.1211 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

GAATTCCGGG AGGGAGGGTA GGGGGGCGGG CCGCGGGGGG CCAAAGCCAG CTAGGCTCAG 60 

TCTCACACGC GCGCCGCCAC TGTTTGTATA TAGTGCGCTC CTGGCCTCAG GCTCGCTCCC 120 

CTCCAGCTCT CGCTTCATTG TTCTCCAAGT CAGAAGCCCC CGCATCCGCC GCGCAGCAGC 180 

GTGAGCCGTA GTCACTGCTG GCCGCTTCGC CTGCGTGCGC GCACGGAAAT CGGGGAGCCA 240 

GGAACCCAAG GAGCCGCCGT CCGCCCGCTG TGCCTCTGCT AGACCACTCG CAGCCCCAGC 300 

CTCTCTCAAG CGCACCCACC TCCGCGCACC CCAGCTCAGG CGAAGCTGGA GTGAGGGTGA 360 

ATCACCCTTT CTCTAGGGCC ACCACTCTTT TATCGCCCTT CCCAAGATTT GAGAAGCGCT 420 

GCGGGAGGAA AGACGTCCTC TTGATCTCTG ACAGGGCGGG GTTTACTGCT GTCCTGCAGG 480 

CGCGCCTCGC CTACTGTGCC CTCCGCTACG ACCCCGGACC AGCCCAGGTC ACGTCCGTGA 540 

GAAGGGATCA TGAAGCACTT GGTAGCAGCC TGGCTTTTGG TTGGACTCAG CCTCGGGGTG 500 

CCCCAGTTCG GCAAAGGTGA CATTTGCAAC CCGAACCCCT GTGAAAATGG -TGGCATCTGT 550 

CTGTCAGGAC TGGCTGATGA TTCCTTTTCC TGTGAGTGTC CAGAAGGCTT CGCAGGTCCG 720 

AACTGCTCTA GTGTTGTGGA GGTTGCATCA G AT G AA G AAA AGCCTACTTC AGCAGGTCCC 7 80 

TGCATCCCTA ACCCATGCCA TAACGGAGGA ACCTGTGAGA TAAGCGAAGC CTATCGAGGA 840 
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GACACATTCA TAGGCTATGT TTGTAAATGT CCTCGGGGAT TTAATGGGAT TCACTGTCAG 

CACAATATAA ATGAATGTGA AGCTGAGCCT TGCAGAAATG GCGGAATATG TACCGACCTT 

GTTGCTAACT ACTCTTGTGA ATGCCCAGGA GAATTTATGG GACGAAATTG TCAATATAAA 

TGCTCTGGGC ACTTGGGAAT CGAAGGTGGG ATCATATCTA ATCAGCAAAT CACAGCTTCA 

TCTAATCACC GAGCTCTTTT TGGACTCCAG AAGTGGTATC CCTACTATGC TAGACTTAAT 

AAGAAGGGCC TTATAAATGC CTGGACAGCT GCTGAAAATG ACAGATGGCC ATGGATTCAG 

GTAACAGTGG GATGAGACAA ATCCATTTCC CAAATTATCA GAATCATTAT AGAAGTAGGT 

TAGGGAGAAT TGGCTGTGAT TCTTTCTCAT GGTTAAAATG TGATTTAGTT CAGAATTAAC 

ATGGTTGGAA ACTCTAAAAA ATGTGGAAAA CAGGAACATT CTATGTCTGA AAATCTGAAA 

ATAGCATCAA GATGAAAACA TTCTTTAGTC ATAAATATAC TCTTTTAAGT TATAGTAGAG 

AAAAAG AT CT TATCATTTCA TAAGTGGACT TTTGGGATAG CATTGGAAAT G T AAATG AAA 

TAAATACCTA ATTGAAAAAA GTTTATTCTA AAGTGTTAAT ATTTAGCAAC AGATTCAGAG 

ACAAGAAAGT AACAATTCAA TCTGTGTATT TTTTGTGAGA AATAGTTTCC CATGTGCAAA 

TATAAAGTGC G CAT CAT AT C ATGATAATAT CCAACTGTCT GCAGAACTCC CTTT CAT AAA 

TGAGAGAATT TTAATTCATA GTGCCTTATA TCCTCATCAG CCATCTGACT TTACTACAGA 

AGAAAACAAT GAAATGATGC ATTAAGTGCT TTGCTAGAAG AAA CAT CAT A GCAAAGCTGA 

TAGCCCACAT TCTGTGCANN NAAGCTTCCA G AGC ACT CG A G AAAAAG C AG AAATGAGATG 

TTTTATGAAA ACCGAAAAGA TAATCTGATT TCTGTGAAAT ATACTTTTGA TCATGTGGTT 

CTTT AAG ATA GTCACTAACA AGTCATTAGT AG C AG AT AC C AAATGGGAGA AAATTTCCAG 

TATACTGAGG GTCAAGGCAG TCATGCTGAA ACTACATGAG GTCAGGAAAG TTTTGAAATA 

AGGTGATTTT GGAAGGATAC CTTCAACTGG CCTAGATTTT C AAG AAACAG TGTAATCAAC 

AG CC AAA CAT GAGAATCTAG CTAACAGCAT TTAGAAAACC AG AA CT AAG A GTGTTACTGG 

GGAATTGCAT TTAAATCCAG TATGAGAGTT TGCAAATGCC GTATTCTTCT AAGGGGTTTG 

TGCCACATTT TGTTACCATG GAGTCCTCTG TAAGAACTTT ATT AG AT AAA T CAT CTTT A C 

ACTATAATTT GAATAAAAGC CGGAATTC 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 480 amino acids 
(3) TYRE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYRE: prorein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met: Lys Arg Ser Vai Ala Val Tr-o Leu Leu Val GIv Leu Ser Leu 
1 5 10 15 
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Val Pro Gin Phe Glv Lys Glv Asr> He Cvs Asd Pro Asn Pro Cys Glu 

20 " 2 5 ~ 30 



Asn Gly Giy lie Cys Leu Pro Giy Leu Ala Vai Gly Ser Phe Ser Cys 
35~ 40 " 45 

Glu Cys Pro Asp Giy Phe Thr Asp Pro Asn Cys Ser Ser Val Val Glu 
50 " * 55 " ~ 60 

Vai Ala Ser Asp Glu Glu Glu Pro Thr Ser Ala Glv Pro Cvs Thr Pro 
65 70 75 ' ' 80 

Asn Pro Cys His Asn Gly Gly Thr Cys Glu He Ser Glu Ala Tyr Arg 

85 ' * " 90 95 

Giy Asp Thr Phe lie Giy Tyr Val Cys Lys Cys Pro Arg Gly Phe Asn 

100 * 105 ' " 110 

Giy lie His Cys Gin His Asn He Asn Giu Cys Glu Val Glu Pro Cys 
115 120 125 

Lys Asn Gly Gly lie Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu 
130 *" 135 " 140 

Cys Pro Giy Glu Phe Met Gly Arg Asn Cvs Gin Tvr Lvs Cys Ser Gly 
145 150 155 ~ 160 

Pro Leu Giy lie Glu Gly Gly lie lie Ser Asn Gin Gin He Thr Ala 

165 170 175 

Ser Ser Thr His Arg Ala Leu Phe Gly Leu Gin Lys Trr> Tyr Pro Tyr 

180 185 ~ 190 

Tyr Ala Arg Leu Asn Lys Lys Giy Leu He Asn Ala Tru Thr Ala Ala 
195 200 205 

Giu Asn Asp Arg Trp Lys Arg Tro lie Gin He Asn Leu Gin Arg Lvs 
210 215 " 220 

Met Arg Val Thr Giy Val He Thr Gin Glv Ala Lys Arg He Glv Ser 
225 230 235 240 

Pro Glu Tyr He Lys Phe Tyr Lys He Ala Tyr Ser Asn Asp Gly Lys 

245 - 250 * 255 

Thr Trp Ala Met Tyr Lys Vai Lys Gly Thr Asn Giu Asr> Met Val Phe 

260 265 270 

Arg Gly Asn He Asr> Asn Asn Thr Pro Tvr Ala Asn Ser Phe Thr Pro 
275 260 " 285 

Pro He Lys Ala Gin Tvr Vai Arg Leu Tvr Pro Gin Val Cvs Arg Arg 
2 90 29 5 " 300 

His Cys Thr Leu Arg Met Glu Leu Leu Glv Cvs Giu Leu Ser Glv Cvs 
305 " 310 ' 315 " 320 

Ser Glu Pro Leu Giy Met Lys Ser Giy His He Gin Asd Tyr Gin He 

325 " 330 " 33 5 

Thr Ala Ser Ser lie Phe Arg Thr Leu Asn Met Aso Met Phe Thr Tro 

340 345 " 350 



Glu Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys Vai Asn Ala Trp 

355 360 365 

Thr Ser Gly His Asn Aso Gin Ser Gin Tro Leu Gin Vai Xaa Leu Leu 

370 " " 375 330 

Vai Pro Thr Lys Vai Thr Gly lie lie Thr Gin Gly Ala Lvs Astd Xaa 

385 390 395 " " 400 

Gly His Vai Gin Phe Vai Gly Ser Tyr Lys Leu Ala Tyr Ser Asn Asp 

405 410 415 

Gly Glu His Trr> Thr Vai Xaa Gin Asn Glu Lys Gin Arg Lvs Asn Lvs 

420 425 " ~ 430 

Vai Xaa Gin Gly Asn Phe Asp Asn Asp Thr His Arg Lys Asn Vai lie 

435 " " 440 ~ 44 5 

Asp Pro Pro lie Tyr Ala Arg His lie Arg lie Leu Pro Trp Ser Trp 

450 " 455 460 

Tyr Gly Arg lie Thr Leu Ala Ser Glu Leu Leu Gly Cys Thr Glu Glu 

465 470 475 



Glu 
480 



WHAT IS CLAIMED IS: 



1. An isolated nucleotide nucleic acid molecule 
comprising a nucleotide sequence encoding protein which has 

5 three £GF- like domains and two discoidin I/factor VIII -like 
domains . 

2. An isolated nucleic acid molecule, comprising a 
nucleotide sequence that hybridizes under stringent 

0 conditions to a nucleic acid molecule having the nucleotide 
sequence of SEQ ID NO: 19. 

3. An isolated nucleic acid molecule, comprising a 
nucleotide sequence that encodes a polypeptide having the 

5 amino acid sequence of SEQ ID NO: 10 or its complement. 



An isolated nucleic acid molecule, comprising a 
nucleotide sequence that encodes a polypeptide having the 
amino acid sequence of SEQ ID NO: 29 or its complement. 

0 

5. An isolated nucleic acid molecule, comprising a 
nucleotide seauence of SEQ ID NO: 28 or its complement . 

6 . A recombinant DNA vector containing a nucleotide 
5 sequence of Claim 2, 3, 4 or 5 . 

7 . A recombinant DNA vector containing a nucleotide 
seauence that encodes a Del- 1 fusion protein. 

0 8 . The recombinant DNA vector of Claim 6 in which the 

del -I nucleotide sequence is operatively associated with a 
regulatory sequence that controls gene expression in a host 

5 9 . The recombinant DNA vector of Claim 7 in which the 

del -I fusion orotein nucleotide seauence is ooeratively 



associated with a regulatory sequence that controls fusion 
protein gene expression in a host cell . 

10. An engineered host cell chat contains the 
recombinant DNA expression vector of Claim 8 . 

11. An engineered ceil line that contains the 
recombinant DNA expression vector of Claim 8 and expresses 
Del-i . 

12. An engineered cell line that contains the 
recombinant DNA expression vector of Claim 9 and expresses 
Del - 1 fusion protein. 

13 . The engineered cell line of Claim 11 or 12 which 
expresses the Del-i on the surface of the cell. 

14 . The engineered cell line of Claim 11 or 12 that 
expresses the Del-i as a soluble protein or fragments 
thereof . 

15. A method for producing recombinant Del - 1 
comprising : 

(a) culturing a host cell transformed with a 
recombinant DNA expression vector containing a 
nucleotide seauence that encodes a Del-1 protein; 
and 

(b) recovering the Del-i Drocein aene oroduct from th 
cell culture . 

16. A method for producing recombinant Del-1 fusion 
protein, comprising : 

(a) culturing a host cell transformed with a 

recombinant DNA expression vector containinq a 



1. 
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(b) recovering the Del-1 fusion protein from the cell 
culture . 

17. An isolated recombinant Del-l protein which has 

5 three SGF- like domains and two discoidin l/facuor VII I - like 
domains . . 

18. A fusion protein comprising Del-1 linked to a 
heterologous protein or peptide sequence or portions thereof 

10 

19. An oligonucleotide which encodes an antisense 
sequence complementary to the del-1 nucleotide sequence, and 
which inhibits translation of the del-1 gene in a cell. 

15 20. The oligonucleotide of Claim 19 which is 

complementary to a nucleotide sequence encoding the amino 
terminal region of the del -I. 

21. An antibody which immunospecif ically binds to an 
20 epitope of the Del-1. 

22. The antibody of Claim 21 which is of monoclonal 
origin . 

25 23. The antibody of Claim 22 which competitively 

inhibits the binding of a molecule to the Del-1. 

24. The antibody of Claim 22 which is linked to a 
cytotoxic .agent . 

30 

25. The antibody of Claim 22 which is linked to a 
radioisotope . 

26. The antibody of Claim 22 which is anchored on a 
35 solid su"or>ort . 

27 . The antibody of Claim 22 which is linked to biotin 
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28 . A met: hod for screening and identifying antagonises 
of Del - 1 comprising: 

(a) contacting a cell line chat expresses Del - 1 

with a test compound; and 
Co) determining whether the test compound 

inhibits the expression or function of Del- 



29 . The method according to Claim 28 in which the cell 
line is a genetically engineered cell line. 

3 0 . The method according to Claim 2 8 in which the cell 
line endogenous ly expresses Del - 1 . 

31. A method for screening and identifying a binding 
partner of Del - 1 activity comprising: 

(a) contacting Del - 1 protein with a random 
peptide library such that Del - 1 will 
recognize and bind to one or more oeotide 
species within the library ; 

(b) isolating the Del - 1 combination; and 

(c) determining the sequence of the peptide 
isolated in step b. 

32. The method according to Claim 31 in which the Del- 
protein is genetically engineered. 



33 . A method of detecting and isolating embryonic cells 
comprising incubating a cell mixture with an ant i -Del -1 
antibody, and isolating the antibody-bound cells. 



A3 S TRACT 

The presenc invention relates to a member of a novel 
gene family referred no as developmental ly- regulated 
endothelial cell locus * 1 ( del -1) . In particular , the 
invention relates to del -1 nucleotide sequences, Del - 1 amino 
acid sequences , methods of expressing a functional gene 
product, and methods of using the gene and gene product. 
Since del - 1 is expressed in endothelial cells and certain 
cancer cells, it may be useful as an endothelial cell and 
tumor marker. In addition, the ability of Dei - 1 to inhibit 
vascular formation allows its use as an ant i - angiogenic 
agent . 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Quertermous , Thomas 

Hogan, Brigid 
Snodgr as s , H . Ralph 
Zupancic, Thomas J. 

(ii) TITLE OF INVENTION: DEVELOPMENTALLY- REGULATED ENDOTHELIAL 
CELL LOCUS -1 

(iii) NUMBER OF SEQUENCES: 29 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds LLP 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: United States 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/659,235 

(B) FILING DATE: 05-JUN-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Poissant, Brian M. 

(B) REGISTRATION NUMBER: 2 8,462 

(C) REFERENCE /DOCKET NUMBER: 8 9 07-0034-999 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-8864/9741 

(C) TELEX: 66141 Pennie 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Asp Leu Leu Val Pro Thr Lys Val Thr Gly lie He Thr Gin Gly Xaa 
15 10 is 



Xaa Ala Lys Asp Phe Gly Asp Val Leu Phe Val Gly Ser Tyr Lys Leu 

20 * ' 25 30 

Ala Tyr Ser Asn Asp Gly Glu His Trp Met Val His Gin Asp Glu Lys 
35 40 * 45 

Gin Arg Lys Asp Lys Val Phe Gin Gly Asn Phe Asp Asn Asp Thr His 
50 55 60 

Arg Lys Asn Val lie Asp Pro Pro lie Tyr Ala Arg Phe lie Arg lie 
65 70 75 80 

Leu Pro Leu Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Asp Leu Gly Ser Ser Lys Glu Val Thr Gly lie lie Thr Gin Gly Xaa 
1 5 10 15 

Xaa Ala Arg Asn Phe Gly Ser Val Gin Phe Val Ala Ser Tyr Lys Val 

20 25 30 

Ala Tyr Ser Asn Asp Ser Ala Asn Trp Thr Glu Tyr Gin Asp Pro Arg 
3 5 4 0 45 

Thr Gly Ser Ser Lys Val Phe Gin Gly Asn Leu Asp Asn Asn Ser His 
50 55 60 

Lys Lys Asn lie Phe Glu Lys Pro Phe Met Ala Arg Tyr Val Arg Val 
65 70 75 3 " 80 

Leu Pro Val Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
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Asp Leu Leu Lys lie Lys Lys lie Thr Ala lie lie Thr Gin Gly Xaa 
1 5 10 15 

Xaa Cys Lys Ser Leu Ser Ser Glu Met Tyr Val Lys Ser Tyr Thr lie 

20 25 ^ 30 

His Tyr Ser Glu Gin Gly Val Glu Trp Lys Pro Tyr Arg Leu Lys Ser 
3 5 40 45 

Ser Met Val Asp Lys lie Phe Glu Gly Asn Thr Asn Thr Lys Gly His 
50 55 60 

Val Lys Asn Phe Phe Asn Pro Pro lie lie Ser Arg Phe lie Arg Val 
65 70 75 80 

lie Pro Lys Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Asp Leu Gin Lys Thr Met Lys Val Thr Gly lie lie Thr Gin Gly Xaa 
1 5 10 15 

Xaa Val Lys Ser Leu Phe Thr Ser Met Phe Val Lys Glu Phe Leu lie 

20 25 30 

Ser Ser Ser Gin Asp Gly His His Trp Thr Xaa Xaa Gin lie Leu Tyr 
35 40 45 

Asn Gly Lys Val Lys Val Phe Gin Gly Asn Gin Asp Ser Ser Thr Pro 
50 55 60 

Met Met Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg Xaa Xaa Xaa Xaa 
65 70 75 80 

Xaa Xaa Xaa Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Asp Leu Glu Asn Leu Arg Phe Val Ser Gly lie Gly Thr Gin Gly Ala 
1 " 5 10 15 

lie Ser Lys Glu Thr Lys Lys Lys Tyr Phe Val Lys Ser Tyr Lys Val 

20 25 30 

Asp lie Ser Ser Asn Gly Glu Asp Trp lie Xaa Xaa Thr Leu Lys Gly 
35 40 4 5 

Asp Asn Lys His Leu Val Phe Thr Gly Asn Thr Asp Ala Thr Asp Val 
50 ' 55 * 60 

Val Tyr Arg Pro Phe Ser Lys Pro Val lie Thr Arg Phe Val Arg Leu 
65 * 70 ~ 75 ^ 80 

Arg Pro Val Thr Trp 

85 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



i--. 

i : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Asp Leu Ala Glu Glu Lys lie Val Arg Gly Val lie lie Gin Gly Xaa 
15 10 15 

Xaa Gly Lys His Lys Glu Asn Lys Val Phe Met Arg Lys Phe Lys lie 

20 25 30 

Gly Tyr Ser Asn Asn Gly Thr Glu Trp Glu Met lie Met Asp Ser Ser 
35 40 45 

Lys Asn Lys Pro Lys Thr Phe Glu Gly Asn Thr Asn Tyr Asp Thr Pro 
50 55 60 

Glu Leu Arg Thr Phe Xaa Ala His lie Thr Thr Gly Phe lie Arg lie 
65 70 75 80 

lie Pro Xaa Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Gly Cys Glu Val Pro Arg Thr Phe Met Cys Val Ala Leu Gin Gly Xaa 
15 10 15 

Xaa Xaa Arg Gly Xaa Asp Ala Asp Gin Trp Val Thr Ser Tyr Lys lie 

20 25 30 

Arg Tyr Ser Leu Asp Asn Val Ser Trp Phe Xaa Xaa Xaa Xaa Xaa Glu 
35 40 45 

Tyr Arg Asn Gly Ala Ala lie Thr Gly Val Thr Asp Arg Asn Thr Val 
50 55 60 

Val Asn His Phe Phe Asp Thr Pro lie Arg Ala Arg Ser lie Ala* lie 
65 70 ~ 75 80 

His Pro Leu Thr Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Asp Leu Xaa Xaa Xaa Xaa Xaa Val Thr Gly lie lie Thr Gin Gly Xaa 
15 10 15 

Xaa Xaa Lys Xaa Xaa Xaa Xaa Xaa Xaa Phe Val Xaa Ser Tyr Lys lie 

20 25 30 

Xaa Tyr Ser Xaa Asp Gly Xaa Xaa Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Lys Xaa Lys Val Phe Xaa Gly Asn Thr Asp Xaa Xaa Thr Xaa 
50 55 60 

Xaa Xaa Asn Xaa Phe Xaa Xaa Pro lie Xaa Xaa Arg Phe lie Arg Xaa 
65 70 75 ~ 80 

Xaa Pro Xaa Xaa Xaa 

85 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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<ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: , 

(A) NAME /KEY : CDS 

(B) LOCATION: 619.. 2058 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GAATTCCGGT TAACTGAGGA CAAAGGGTAA TGCAGAAGTG ATATTTGATT TCCATTCTCA 

TTCCCAGTGG CCTTGATATT TAAACTGATT CCTGCCACCA GGTCCTTGGG CCACCCTGTC 

CCTGCGTCTC ATATTTCTGC ATGCTGCTTT GTTTGTATAT AGTGCGCTCC TGGCCTCAGG 

CTCGCTCCCC TCCAGCTCTC GCTTCATTGT TCTCCAAGTC AGAAGCCCCC GCATCCGCCG 

CGCAGCAGCG TGAGCCGTAG TCACTGCTGG CCGCTTCGCC TGCGTGCGCG CACGGAAATC 

GGGGAGCCAG GAACCCAAGG AGCCGCCGTC CGCCCGCTGT GCCTCTGCTA GACCACTCGC 

AGCCCCAGCC TCTCTCAAGC GCACCCACCT CCGCGCACCC CAGCTCAGGC GAAGCTGGAG 

TGAGGGTGAA TCACCCTTTC TCTAGGGCCA CCACTCTTTT ATCGCCCTTC CCAAGATTTG 

AGAAGCGCTG CGGGAGGAAA GACGTCCTCT TGATCTCTGA CAGGGCGGGG TTTACTGCTG 

TCCTGCAGGC GCGCCTCGCC TACTGTGCCC TCCGCTACGA CCCCGGACCA GCCCAGGTCA 

CGTCCGTGAG AAGGGATC ATG AAG CAC TTG GTA GCA GCC TGG CTT TTG ,GTT 

Met Lys His Leu Val Ala Ala Trp Leu Leu Val 
1 5 10 

GGA CTC AGC CTC GGG GTG CCC CAG TTC GGC AAA GGT GAC ATT TGC AAC 
Gly Leu Ser Leu Gly Val Pro Gin Phe Gly Lys Gly Asp lie Cys Asn 

15 20 25 

CCG AAC CCC TGT GAA AAT GGT GGC ATC TGT CTG TCA GGA CTG GCT GAT 
Pro Asn Pro Cys Glu Asn Gly Gly lie Cys Leu Ser Gly Leu Ala Asp 
30 35 ~ 40 

GAT TCC TTT TCC TGT GAG TGT CCA GAA GGC TTC GCA GGT CCG AAC TGC 
Asp Ser Phe Ser Cys Glu Cys Pro Glu Gly Phe Ala Gly Pro Asn Cys 
45 50 55 

TCT AGT GTT GTG GAG GTT GCA TCA GAT GAA GAA AAG CCT ACT TCA GCA 
Ser Ser Val Val Glu Val Ala Ser Asp Glu Glu Lys Pro Thr Ser Ala 
60 65 70 * 75 

GGT CCC TGC ATC CCT AAC CCA TGC CAT AAC GGA GGA ACC TGT GAG ATA 
Gly Pro Cys lie Pro Asn Pro Cys His Asn Gly Gly Thr Cys Glu lie 

80 85 * 90 

AGC GAA GCC TAT CGA GGA GAC ACA TTC ATA GGC TAT GTT TGT AAA TGT 
Ser Glu Ala Tyr Arg Gly Asp Thr Phe lie Gly Tyr Val Cys Lys Cys 

95 100 * 105 

CCT CGG GGA TTT AAT GGG ATT CAC TGT CAG CAC AAT ATA AAT GAA TGT 
Pro Arg Gly Phe Asn Gly lie His Cys Gin His Asn lie Asn Glu Cys 
110 115 12 0 

GAA GCT GAG CCT TGC AGA AAT GGC GGA ATA TGT ACC GAC CTT GTT GCT 



Glu Ala Glu Pro Cys Arg Asn Gly Gly lie Cys Thr Asp Leu Val Ala 
125 ~ 130 ' 135 



AAC TAC TCT TGT GAA TGC CCA GGA GAA TTT ATG GGA CGA AAT TGT CAA 
Asn Tyr Ser Cys Glu Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin 
140 " 145 150 155 

TAT AAA TGC TCT GGG CAC TTG GGA ATC GAA GGT GGG ATC ATA TCT AAT 
Tyr Lys Cys Ser Gly His Leu Gly lie Glu Gly Gly lie lie Ser Asn 

16 0 ^ 165 ~ " 170 

CAG CAA ATC ACA GCT TCA TCT AAT CAC CGA GCT CTT TTT GGA CTC CAG 
Gin Gin lie Thr Ala Ser Ser Asn His Arg Ala Leu Phe Gly Leu Gin 

175 180 185 

AAG TGG TAT CCC TAC TAT GCT CGA CTT AAT AAG AAG GGC CTT ATA AAT 
Lys Trp Tyr Pro Tyr Tyr Ala Arg Leu Asn Lys Lys Gly Leu lie Asn 
190 ~ 195 ~ 200 

GCC TGG ACA GCT GCT GAA AAT GAC AGA TGG CCA TGG ATT CAG ATA AAT 
Ala Trp Thr Ala Ala Glu Asn Asp Arg Trp Pro Trp lie Gin lie Asn 
205 210 215 

TTG CAA AGA AAA ATG AGA GTC ACT GGT GTT ATT ACC CAA GGA GCA AAA 
Leu Gin Arg Lys Met Arg Val Thr Gly Val lie Thr Gin Gly Ala Lys 
220 225 230 235 

AGG ATT GGA AGC CCA GAG TAC ATA AAA TCC TAC AAA ATT GCC TAC AGC 
Arg lie Gly Ser Pro Glu Tyr lie Lys Ser Tyr Lys lie Ala Tyr Ser 

240 245 250 

AAT GAC GGG AAG ACC TGG GCA ATG TAC AAA GTA AAA GGC ACC AAT GAA 
Asn Asp Gly Lys Thr Trp Ala Met Tyr Lys Val Lys Gly Thr Asn Glu 

255 " 260 265 

GAG ATG GTC TTT CGT GGA AAT GTT GAT AAC AAC ACA CCA TAT GCT AAT 
Glu Met Val Phe Arg Gly Asn Val Asp Asn Asn Thr Pro Tyr Ala Asn 
270 275 280 

TCT TTC ACA CCC CCA ATC AAA GCT CAG TAT GTA AGA CTC TAC CCC CAA 
Ser Phe Thr Pro Pro lie Lys Ala Gin Tyr Val Arg Leu Tyr Pro Gin 
285 290 295 

ATT TGT CGA AGG CAT TGT ACT TTA AGA ATG GAA CTT CTT GGC TGT GAG 
lie Cys Arg Arg His Cys Thr Leu Arg Met Glu Leu Leu Gly Cys Glu 
300 " ~ 305 310 315 

CTC TCA GGC TGT TCA GAA CCT TTG GGG ATG AAA TCA GGG CAT ATA CAA 
Leu Ser Gly Cys Ser Glu Pro Leu Gly Met Lys Ser Gly His lie Gin 

320 325 330 

GAC TAC CAG ATC ACT GCC TCC AGC GTC TTC AGA ACA CTC AAC ATG GAC 
Asp Tyr Gin lie Thr Ala Ser Ser Val Phe Arg Thr Leu Asn Met Asp 

335 340 " 345 

ATG TTT ACT TGG GAA CCA AGG AAA GCC AGG CTG GAC AAG CAA GGC AAA 
Met Phe Thr Trp Glu Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys 
350 355 " 360 

GTA AAT GCC TGG ACT TCC GGC CAT AAC GAC CAG TCA CAA TGG TTA CAG 
Val Asn Ala Trp Thr Ser Gly His Asn Asp Gin Ser Gin Trp Leu Gin 
365 370 375 



GTT GAT CTT CTT GTC CCT ACT AAG GTG ACA GGC ATC ATT ACA CAA GGA 1803 
Val Asp Leu Leu Val Pro Thr Lys Val Thr Gly lie lie Thr Gin Gly 
380 385 390 395 

GCT AAA GAT TTT GGT CAC GTG CAG TTT GTT GGG TbA TAC AAA CTA GCT 1851 
Ala Lys Asp Phe Gly His Val Gin Phe Val Gly Ser Tyr Lys Leu Ala 

4 00 405 " 410 

TAC AGC AAT GAT GGA GAA CAC TGG ATG GTG CAC CAG GAT GAA AAA CAG 1899 
Tyr Ser Asn Asp Gly Glu His Trp Met Val His Gin Asp Glu Lys Gin 

415 420 425 

AGG AAA GAC AAG GTT TTT CAA GGC AAT TTT GAC AAT GAC ACT CAC AGG 1947 
Arg Lys Asp Lys Val Phe Gin Gly Asn Phe Asp Asn Asp Thr His Arg 
430 435 440 

AAA AAT GTC ATC GAC CCT CCC ATC TAT GCA CGA TTC ATA AGA ATC CTT " 1995 
Lys Asn Val lie Asp Pro Pro lie Tyr Ala Arg Phe lie Arg lie Leu 
445 450 455 





CCT 




Pro 


I '::r. 


460 




TGC 




Cys 



465 470 475 



2043 



2098 



480 

TTTCCTATAA GTATCTCCAC GAAATGAACT GTGTGAAGCT GATGGAAACT GCATTTGTTT 215 8 

TTTTCAAAGT GTTCAAATTA TGGTAGGCTA CTGACTGTCT TTTTAGGAGT TCTAAGCTTG 2218 

CCTTTTTAAT AATTTAATTT GGTTTCCTTT GCTCAACTCT CTTATGTAAT ATCACACTGT 22 78 

CTGTGAGTTA CTCTTCTTGT TCTCT 2 3 03 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Met Lys His Leu Val Ala Ala Trp Leu Leu Val Gly Leu Ser Leu Gly 
l 5 10 15 

Val Pro Gin Phe Gly Lys Gly Asp lie Cys Asn Pro Asn Pro Cys Glu 

20 25 30 

Asn Gly Gly lie Cys Leu Ser Gly Leu Ala Asp Asp Ser Phe Ser Cys 
3 5 4 0 4 5 

Glu Cys Pro Glu Gly Phe Ala Gly Pro Asn Cys Ser Ser Val Val Glu 
50 55 60 

Val Ala Ser Asp Glu Glu Lys Pro Thr Ser Ala Gly Pro Cys lie Pro 
65 70 75 ~ 80 
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t 



Asn Pro 



Gly Asp 



Gly lie 



Arg Asn 
130 

Cys Pro 
145 

His Leu 



Ser Ser 

;:: i:v 

r; Tyr Ala 



Glu Asn 
210 

Arg Val 

225 

Glu Tyr 



y Trp Ala 

^ Gly Asn 



lie Lys 
290 

Cys Thr 
305 

Glu Pro 



Ala Ser 



Pro Arg 



Ser Gly 
370 

Pro Thr 
385 

His Val 



Cys His Asn Gly 
85 

Thr Phe lie Gly 
100 

His Cys Gin His 
115 

Gly Gly lie Cys 



Gly Glu Phe Met 

150 

Gly lie Glu Gly 
165 

Asn His Arg Ala 
180 

Arg Leu Asn Lys 
195 

Asp Arg Trp Pro 



Thr Gly Val He 

230 

He Lys Ser Tyr 
. 245 

Met Tyr Lys Val 
26 0 

Val Asp Asn Asn 
275 

Ala Gin Tyr Val 



Leu Arg Met Glu 

310 

Leu Gly Met Lys 
32 5 

Ser Val Phe Arg 
340 

Lys Ala Arg Leu 
355 

His Asn Asp Gin 



Lys Val Thr Gly 

390 

Gin Phe Val Gly 
405 



Gly Thr Cys 



Tyr Val Cys 
105 

Asn lie Asn 
120 

Thr Asp Leu 
135 

Gly Arg Asn 



Gly He He 



Leu Phe Gly 
185 

Lys Gly Leu 
200 

Trp He Gin 
215 

Thr Gin Gly 



Lys He Ala 



Lys Gly Thr 
265 

Thr Pro Tyr 
280 

Arg Leu Tyr 
295 

Leu Leu Gly 



Ser Gly His 



Thr Leu Asn 
345 

Asp Lys Gin 
360 

Ser Gin Trp 
375 

lie He Thr 



Ser Tyr Lys 



Glu He Ser Glu 
90 

Lys Cys Pro Arg 



Glu Cys Glu Ala 

125 

Val Ala Asn Tyr 
140 

Cys Gin Tyr Lys 
155 

Ser Asn Gin Gin 
170 

Leu Gin Lys Trp 



He Asn Ala Trp 

205 

He Asn Leu Gin 
220 

Ala Lys Arg He 
235 

Tyr Ser Asn Asp 
250 

Asn Glu Glu Met 



Ala Asn Ser Phe 

285 

Pro Gin He Cys 
300 

Cys Glu Leu Ser 
315 

He Gin Asp Tyr 
330 

Met Asp Met Phe 



Gly Lys Val Asn 

365 

Leu Gin Val Asp 
380 

Gin Gly Ala Lys 
3 95 

Leu Ala Tyr Ser 
410 



Ala Tyr Arg 
95 

Gly Phe Asn 
110 

Glu Pro Cys 



Ser Cys Glu 



Cys Ser Gly 
160 

He Thr Ala 
175 

Tyr Pro Tyr 
190 

Thr Ala Ala 



Arg Lys Met 



Gly Ser Pro 
240 

Gly Lys Thr 
255 " 

Val Phe Arg 
270 

Thr Pro Pro 



Arg Arg His 



Gly Cys Ser 
320 

Gin He Thr 
335 

Thr Trp Glu 
3 50 

Ala Trp Thr 



Leu Leu Val 



Asp Phe Gly 
400 

Asn Asp Gly 
415 
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Glu His Trp Met Val His Gin Asp Glu Lys Gin Arg Lys Asp Lys Val 

420 425 430 

Phe Gin Gly Asn Phe Asp Asn Asp Thr His Arg Lys Asn Val lie Asp 

435 440 * 445 

Pro Pro lie Tyr Ala Arg Phe lie Arg lie Leu Pro Trp Ser Trp Tyr 

450 455 ^ 460 

Gly Arg lie Thr Leu Arg Ser Glu Leu Leu Gly Cys Ala Glu Glu Glu 

465 470 475 480 



(2) INFORMATION FOR SEQ ID NO : 11 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1779 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 : 

TCT CTT TAG TCA CCA CTC TCG CCC TCT CCA AGA ATT TGT TTA AC A *AAG 
Ser Leu * Ser Pro Leu Ser Pro Ser Pro Arg lie Cys Leu Thr Lys 
15 10 15 

CGC TGA GGA AAG AGA ACG TCT TCT TGA ATT CTT TAG TAG GGG CGG AGT 
Arg * Gly Lys Arg Thr Ser Ser * lie Leu * * Gly Arg Ser 

20 25 30 

CTG CTG CTG CCC TGC GCT GCC ACC TCG GCT ACA CTG CCC TCC GCG ACG 
Leu Leu Leu Pro Cys Ala Ala Thr Ser Ala Thr Leu Pro Ser Ala Thr 

35 40 45 

ACC CCT GAC CAG CCG GGG TCA CGT CCG GGA GAC GGG ATC ATG AAG CGC 
Thr Pro Asp Gin Pro Gly Ser Arg Pro Gly Asp Gly lie Met Lys Arg 
50 55 60 

TCG GTA GCC GTC TGG CTC TTG GTC GGG CTC AGC CTC GGT GTC CCC CAG 
Ser Val Ala Val Trp Leu Leu Val Gly Leu Ser Leu Gly Val Pro Gin 
65 70 75 80 

TTC GGC AAA GGT GAT ATT TGT GAT CCC AAT CCA TGT GAA AAT GGA GGT 
Phe Gly Lys Gly Asp lie Cys Asp Pro Asn Pro Cys Glu Asn Gly Gly 

85 90 95 

ATC TGT TTG CCA GGA TTG GCT GTA GGT TCC TTT TCC TGT GAG TGT CCA 
lie Cys Leu Pro Gly Leu Ala Val Gly Ser Phe Ser Cys Glu Cys Pro 

100 105 ' 110 

GAT GGC TTC ACA GAC CCC AAC TGT TCT AGT GTT GTG GAG GTT GGA TCA 
Asp Gly Phe Thr Asp Pro Asn Cys Ser Ser Val Val Glu Val Ala Ser 
115 120 125 
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GAT GAA GAA GAA CCA ACT TCA GCA GGT CCC TGC ACT CCT AAT CCA TGC 
Asp Glu Glu Glu Pro Thr Ser Ala Gly Pro Cys Thr Pro Asn Pro Cys 
130 135 140 

CAT AAT GGA GGA ACC TGT GAA ATA AGT GAA GCA 'f'AC CGA GGG GAT ACA 
His Asn Gly Gly Thr Cys Glu lie Ser Glu Ala Tyr Arg Gly Asp Thr 
145 " 150 155 160 

TTC ATA GGC TAT GTT TGT AAA TGT CCC CGA GGA TTT AAT GGG ATT CAC 
Phe lie Gly Tyr Val Cys Lys Cys Pro Arg Gly Phe Asn Gly lie His 

165 170 175 

TGT CAG CAC AAC ATA AAT GAA TGC GAA GTT GAG CCT TGC AAA AAT GGT 
Cys Gin His Asn lie Asn Glu Cys Glu Val Glu Pro Cys Lys Asn Gly 

180 185 190 

GGA ATA TGT ACA GAT CTT GTT GCT AAC TAT TCC TGT GAG TGC CCA GGC 
Gly lie Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu Cys Pro Gly 
195 200 205 

GAA TTT ATG GGA AGA AAT TGT CAA TAC AAA TGC TCA GGC CCA CTG GGA 
Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cys Ser Gly Pro Leu Gly 
210 * 215 220 

ATT GAA GGT GGA ATT ATA TCA AAC CAG CAA ATC ACA GCT TCC TCT ACT 
lie Glu Gly Gly lie lie Ser Asn Gin Gin lie Thr Ala Ser Ser Thr 
225 230 235 240 

CAC CGA GCT CTT TTT GGA CTC CAA AAA TGG TAT CCC TAC TAT GCA CGT 
His Arg Ala Leu Phe Gly Leu Gin Lys Trp Tyr Pro Tyr Tyr Ala Arg 

245 250 255. 

CTT AAT AAG AAG GGG CTT ATA AAT GCG TGG ACA GCT GCA GAA AAT GAC 
Leu Asn Lys Lys Gly Leu lie Asn Ala Trp Thr Ala Ala Glu Asn Asp 

260 265 270 

AGA TGG AAG CGG TGG ATT CAG ATA AAT TTG CAA AGA AAA ATG AGA GTT 
Arg Trp Lys Arg Trp lie Gin He Asn Leu Gin Arg Lys Met Arg Val 
275 ~ ~ 280 285 

ACT GGT GTG ATT ACC CAA GGG GCC AAG AGG ATT GGA AGC CCA GAG TAT 
Thr Gly Val He Thr Gin Gly Ala Lys Arg He Gly Ser Pro Glu Tyr 
290 295 * 300 

ATA AAA TTC TAC AAA ATT GCC TAC AGT AAT GAT GGA AAG ACT TGG GCA 
He Lys Phe Tyr Lys He Ala Tyr Ser Asn Asp Gly Lys Thr Trp Ala 
3 05 ~ 310 315 32 0 

ATG TAC AAA GTG AAA GGC ACC AAT GAA GAC ATG GTG TTT CGT GGA AAC 
Met Tyr Lys Val Lys Gly Thr Asn Glu Asp Met Val Phe Arg Gly Asn 

325 330 335 

ATT GAT AAC AAC ACT CCA TAT GCT AAC TCT TTC ACA CCC CCC ATA AAA 
He Asp Asn Asn Thr Pro Tyr Ala Asn Ser Phe Thr Pro Pro He Lys 

340 * 345 350 

GCT CAG TAT GTA AGA CTC TAT CCC CAA GTT TGT CGA AGA CAT TGC ACT 
Ala Gin Tyr Val Arg Leu Tyr Pro Gin Val Cys Arg Arg His Cys Thr 
355 ~ "* 360 " 365 

TTG CGA ATG GAA CTT CTT GGC TGT GAA CTG TCG GGT TGT TCT GAG CCT 
Leu Arg Met Glu Leu Leu Gly Cys Glu Leu Ser Gly Cys Ser Glu Pro 
370 375 380 
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CTG GGT ATG AAA TCA GGA CAT ATA CAA GAC TAT CAG ATC ACT GCC TCC 
Leu Gly Met Lys Ser Gly His lie Gin Asp Tyr Gin lie Thr Ala Ser 
385 390 395 400 

AGC ATC TTC AGA ACG CTC AAC ATG GAC ATG TTC AbT TGG GAA CCA AGG 
Ser lie Phe Arg Thr Leu Asn Met Asp Met Phe Thr Trp Glu Pro Arg 

405 410 415 

AAA GCT CGG CTG GAC AAG CAA GGC AAA GTG AAT GCC TGG ACC TCT GGC 
Lys Ala Arg Leu Asp Lys Gin Gly Lys Val Asn Ala Trp Thr Ser Gly 

420 425 430 

CAC AAT GAC CAG TCA CAA TGG TTA CAG GTG GAT CTT CTT GTT CCA ACC 
His Asn Asp Gin Ser Gin Trp Leu Gin Val Asp Leu Leu Val Pro Thr 
435 440 445 

AAA GTG ACT GGC ATC ATT ACA CAA GGA GCT AAA GAT TTT GGT CAT GTA 
Lys Val Thr Gly lie lie Thr Gin Gly Ala Lys Asp Phe Gly His Val 
450 455 460 

CAG TTT GTT GGC TCC TAC AAA CTG GCT TAC AGC AAT GAT GGA GAA CAC 
Gin Phe Val Gly Ser Tyr Lys Leu Ala Tyr Ser Asn Asp Gly Glu His 
465 " 470 475 "* 480 

TGG ACT GTA TAC CAG GAT GAA AAG CAA AGA AAA GAT AAG GTT TTC CAG 
Trp Thr Val Tyr Gin Asp Glu Lys Gin Arg Lys Asp Lys Val Phe Gin 

485 490 495 

GGA AAT TTT GAC AAT GAC ACT CAC AGA AAA AAT GTC ATC GAC CCT CCC 
Gly Asn Phe Asp Asn Asp Thr His Arg Lys Asn Val lie Asp Pro Pro 

500 505 510 

ATC TAT GCA CGA CAC ATA AGA ATC CTT CCT TGG TCC TGG TAC GGG AGG 
lie Tyr Ala Arg His lie Arg lie Leu Pro Trp Ser Trp Tyr Gly Arg 
515 ~ 520 525 

ATC ACA TTG GCG TCA GAG CTG CTG GGC TGC ACA GAG GAG GAA TGA GGG 
lie Thr Leu Ala Ser Glu Leu Leu Gly Cys Thr Glu Glu Glu * Gly 
530 535 540 

GAG GCT ACA TTT CAC AAC CGT CTT CCC TAT TTG GGT AAA AGT ATC TCC 
Glu Ala Thr Phe His Asn Arg Leu Pro Tyr Leu Gly Lys Ser lie Ser 
545 550 555 ~ 560 

ATG GAA TGA ACT GTG TAA AAT CTG TAG GAA ACT GAA TGG TTT TTT TTT 
Met Glu * Thr Val * Asn Leu * Glu Thr Glu Trp Phe Phe Phe 

565 570 575 

TTT TCA TGA AAA AGT GGT CAA ATT ATG GTA GGC AAC TAA CGG TGT TTT 
Phe Ser * Lys Ser Gly Gin lie Met Val Gly Asn * Arg Cys Phe 

580 585 590 

TAC C 
Tyr 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser Pro Leu Ser Pro Ser Pro Arg lie Cys* Leu Thr Lys Arg 
1 5 10 * 14 

INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Gly Lys Arg Thr Ser Ser 
1 5 

INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRI3 

Gly Arg Ser Leu Leu 
1 5 

Ser Ala Thr Thr Pro 

20 

Met Lys Arg Ser Val 
35 

Val Pro Gin Phe Gly 
50 

Asn Gly Gly lie Cys 
65 

Glu Cys Pro Asp Gly 

85 

Val Ala Ser Asp Glu 

100 

Asn Pro Cys His Asn 
115 

Gly Asp Thr Phe lie 
130 



TION: SEQ ID NO : 14 : 

Leu Pro Cys Ala Ala Thr 

10 

Asp Gin Pro Gly Ser Arg 

2 5 

Ala Val Trp Leu Leu Val 
40 

Lys Gly Asp lie Cys Asp 
55 

Leu Pro Gly Leu Ala Val 
70 75 

Phe Thr Asp Pro Asn Cys 

90 

Glu Glu Pro Thr Ser Ala 

105 

Gly Gly Thr Cys Glu He 
120 

Gly Tyr Val Cys Lys Cys 
135 



Ser Ala Thr Leu Pro 

15 

Pro Gly Asp Gly He 
30 

Gly Leu Ser Leu Gly 
45 

Pro Asn Pro Cys Glu 
60 

Gly Ser Phe Ser Cys 

80 

Ser Ser Val Val Glu 

95 

Gly Pro Cys Thr Pro 
110 

Ser Glu Ala Tyr Arg 
125 

Pro Arg Gly Phe Asn 
140 
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Gly lie His Cys Gin His Asn lie Asn Glu Cys Glu Val Glu Pro Cys 
145 150 155 160 

Lys Asn Gly Gly lie Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu 

165 170 1 175 

Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cys Ser Gly 

180 185 190 

Pro Leu Gly lie Glu Gly Gly lie lie Ser Asn Gin Gin lie Thr Ala 
195 200 205 

Ser Ser Thr His Arg Ala Leu Phe Gly Leu Gin Lys Trp Tyr Pro Tyr 
210 215 22 0 

Tyr Ala Arg Leu Ash Lys Lys Gly Leu lie Asn Ala Trp Thr Ala Ala 
225 230 235 240 

Glu Asn Asp Arg Trp Lys Arg Trp lie Gin lie Asn Leu Gin Arg Lys 

245 250 255 

Met Arg Val Thr Gly Val lie Thr Gin Gly Ala Lys Arg lie Gly Ser 

260 265 270 

Pro Glu Tyr lie Lys Phe Tyr Lys lie Ala Tyr Ser Asn Asp Gly Lys 
275 280 285 

Thr Trp Ala Met Tyr Lys Val Lys Gly Thr Asn Glu Asp Met Val Phe 
290 295 300 

Arg Gly Asn lie Asp Asn Asn Thr Pro Tyr Ala Asn Ser Phe Thr Pro 
305 310 " 315 * 320 

Pro lie Lys Ala Gin Tyr Val Arg Leu Tyr Pro Gin Val Cys Arg Arg 

325 330 335 

His Cys Thr Leu Arg Met Glu Leu Leu Gly Cys Glu Leu Ser Gly Cys 

340 345 ' ' 3 50 

Ser Glu Pro Leu Gly Met Lys Ser Gly His lie Gin Asp Tyr Gin lie 
355 360 365 

Thr Ala Ser Ser lie Phe Arg Thr Leu Asn Met Asp Met Phe Thr Trp 
370 375 380 

Glu Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys Val Asn Ala Trp 
385 390 395 " 400 

Thr Ser Gly His Asn Asp Gin Ser Gin Trp Leu Gin Val Asp Leu Leu 

405 410 415 

Val Pro Thr Lys Val Thr Gly lie lie Thr Gin Gly Ala Lys Asp Phe 

420 425 430 

Gly His Val Gin Phe Val Gly Ser Tyr Lys Leu Ala Tyr Ser Asn Asp 

435 440 445 

Gly Glu His Trp Thr Val Tyr Gin Asp Glu Lys Gin Arg Lys Asp Lys 
450 455 460 

Val Phe Gin Gly Asn Phe Asp Asn Asp Thr His Arg Lys Asn Val lie 
465 470 475 " " 480 
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Asp Pro Pro lie Tyr Ala Arg His lie Arg lie Leu Pro Trp Ser Trp 

485 490 495 

Tyr Gly Arg lie Thr Leu Ala Ser Glu Leu Leu Gly Cys Thr Glu Glu 

500 505 A 510 

Glu 

(2) INFORMATION FOR SEQ ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Gly Glu Ala Thr Phe His Asn Arg Leu Pro Tyr Leu Gly Lys Ser lie 
1 5 10 " 15 

Ser Met Glu 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16 : 

Glu Thr Glu Trp Phe Phe Phe Phe Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Lys Ser Gly Gin lie Met Val Gly Asn 
15 

(2) INFORMATION FOR SEQ ID NO : 18 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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1 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
Arg Cys Phe Tyr * 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
iS8 GACAGATGGC CATGGATTCA GATAAATTTG CAAAGAAAAA TGAGAGTCAC TGGTGTTATT 60 
: T ACCCAAGGAG CAAAAAGGAT TGGAAGCCCA GAGTACATAA AATCCTACAA AATTGCCTAC 120 

i .-..J 
•* 

AGCAATGACG GGAAGAC CTG GGCAATGTAC AAAGTAAAAG GCACCAATGA AGAGATGGTC 18 0 

TTTCGTGGAA ATGTTGATAA CAACACACCA TATGCTAATT CTTTCACACC CCCAATCAAA 240 

H GCTCAGTATG TAAGACTCTA CCCCCAAATT TGTCGAAGGC ATTGTACTTT AAGAATGGAA 3 00 

ri CTTCTTGGCT GTGAGCTC , 318 

(2) INFORMATION FOR SEQ ID NO: 20: 

fj] (i) SEQUENCE CHARACTERISTICS: 

*j] (A) LENGTH: 320 amino acids 

(B) TYPE: amino acid 
m (C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DE£ 

Cys Ser Thr Gin 
1 

lie Ser Ala Ser 

20 

Gly Pro Glu Leu 
35 

His Ala Ser Asn 
50 

Leu Arg Lys Met 
65 

Ala Gly Arg Ala 



CRIPTION: SEQ I! 

Leu Gly Met Glu 
5 

Tyr Val Tyr Met 



Ala Arg Leu Tyr 

40 

Tyr Asp Xaa Ser 
55 

Arg Val Ser Gly 
70 

Glu Tyr Leu Lys 



NO: 20 : 

Gly Gly Ala lie 
10 

Gly Phe Met Gly 
25 

Arg Thr Gly lie 



Lys Pro Trp lie 

60 

Val Met Thr Gin 
75 

Thr Phe Lys Val 



Ala Asp Ser Gin 
15 

Leu Gin Arg Trp 
30 

Val Asn Ala Trp 
45 

Gin Val Asn Leu 



Gly Ala Ser Arg 

80 

Ala Tyr Ser Leu 
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85 90 95 

Asp Gly Xaa Arg Lys Phe Glu Phe lie Gin Asp Glu Ser Gly Gly Asp 

100 * 105 110 

Lys Glu Phe Leu Gly Asn Leu Asp Asn Asn Ser Leu Lys Val Asn Met 
115 12 0 12 5 

Phe Asn Pro Thr Leu Glu Ala Gin Tyr lie Arg Leu Tyr Pro Val Ser 
130 135 " 140 

Cys His Arg Gly Cys Thr Leu Arg Phe Glu Leu Leu Gly Cys Glu Leu 
145 " 150 155 160 

His Gly Cys Leu Glu Pro Leu Gly Leu Lys Asn Asn Thr lie Pro Asp 

165 170 175 

Ser Gin Met Ser Ala Ser Ser Ser Tyr Lys Thr Trp Asn Leu Arg Ala 

180 185 190 

Phe Gly Trp Tyr Pro His Leu Gly Arg Leu Asp Asn Gin Gly Lys lie 
195 ~ 2 00 2 05 

Asn Ala Trp Thr Ala Gin Ser Asn Ser Ala Lys Glu Trp Leu Gin Val 
210 215 * 220 

Asp Leu Gly Thr Gin Arg Gin Val Thr Gly lie lie Thr Gin Gly Ala 
225 230 235 240 

Arg Asp Phe Gly His lie Gin Tyr Val Glu Ser Tyr Lys Val Ala His 

245 ' 250 * ~ 255 

-v 

Ser Asp Asp Gly Val Gin Trp Thr Val Tyr Xaa Xaa Glu Glu Gin Gly 

260 265 ~ 270 

Ser Ser Lys Val Phe Gin Gly Asn Leu Asp Asn Asn Ser His Lys Lys 
275 " 280 ^ 285 

Asn lie Phe Glu Lys Pro Phe Met Ala Arg Tyr Val Arg Val Leu Pro 
290 295 300 

Val Ser Trp His Asn Arg lie Thr Leu Arg Leu Glu Leu Leu Gly Cys 
305 ' 310 ^ 315 ~ 320 



INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Cys Ser Gly Pro Leu Gly lie Glu Gly Gly lie lie Ser Asn Gin Gin 
15 10 15 
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lie Thr Ala Ser Ser Thr His Arg Ala Leu Phe Gly Leu Gin Lys Trp 

20 25 30 

Tyr Pro Tyr Tyr Ala Arg Leu Asn Lys Lys Gly Leu lie Asn Ala Trp 
35 " ~ 40 " 45 

Thr Ala Ala Glu Asn Asp Arg Trp Asn Arg Trp lie Gin lie Asn Leu 
50 55 ~ " 60 

Gin Arg Lys Met Arg Val Thr Gly Val lie Thr Gin Gly Ala Lys Arg 
65 70 75 80 

lie Gly Ser Pro Glu Tyr lie Lys Phe Tyr Lys lie Ala Tyr Ser Asn 

85 90 95 

Asp Gly Lys Thr Trp Ala Met Tyr Lys Val Lys Gly Thr Asn Glu Asp 

100 105 110 

Met Val Phe Arg Gly Asn lie Asp Asn Asn Thr Pro Tyr Ala Asn Ser 
115 120 125 

Phe Thr Pro Pro lie Lys Ala Gin Tyr Val Arg Leu Tyr Pro Gin Val 
130 135 140 

Cys Arg Arg His Cys Thr Leu Arg Met Glu Leu Leu Gly Cys Glu Leu 
145 " 150 " 155 "* 160 

Ser Gly Cys Ser Glu Pro Leu Gly Met Lys Ser Gly His lie Gin Asp 

165 ~ 170 175 

Tyr Gin lie Thr Ala Ser Ser lie Phe Arg Thr Leu Asn Met Asp Met 

180 185 190 

Phe Thr Trp Glu Pro Arg Lys Ala Arg Leu Asp Lys Gin Gly Lys Val 
195 2 00 " 2 05 

Asn Ala Trp Thr Ser Gly His Asn Asp Gin Ser Gin Trp Leu Gin Val 
210 "' 215 ^ 220 

Xaa Leu Leu Val Pro Thr Lys Val Thr Gly lie lie Thr Gin Gly Ala 
225 230 235 ' 240 

Lys Asp Xaa Gly His Val Gin Phe Val Gly Ser Tyr Lys Leu Ala Tyr 

245 250 ^ 255 

Ser Asn Asp Gly Glu His Trp Thr Val Xaa Gin Asp Glu Lys Gin Arg 

260 265 270 

Lys Asp Lys Val Xaa Gin Gly Asn Phe Asp Asn Asp Thr His Arg Lys 
275 280 285 

Asn Val lie Asp Pro Pro lie Tyr Ala Arg His lie Arg lie Leu Pro 
290 295 " 300 

Trp Ser Trp Tyr Gly Arg lie Thr Leu Ala Ser Glu Leu Leu Gly Cys 
305 310 315 320 

Thr 



INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS 
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(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Lys Arg Ser Val Ala Val Trp Leu Leu Val Gly Leu Ser Leu Gly 
1 5 10 15 

Val Pro Gin Phe Gly Lys Gly Asp lie 

20 25 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Cys Asp Pro Asn Pro Cys Glu Asn Gly Gly lie Cys Leu Pro Gly Leu 
1 5 10 15 

Ala Val Gly Xaa Xaa Xaa Xaa Xaa Ser Phe Ser Cys Glu Cys Pro Asp 

20 25 30 

Gly Phe Thr Asp Pro Asn Cys Ser Ser Val Val Glu Val Ala Ser Asp 
35 40 45 

Glu Glu Glu Pro Thr Ser Ala Gly Pro 
50 55 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 24 : 

Cys Thr Pro Asn Pro Cys His Asn Gly Gly Thr Cys Glu lie Ser Glu 
1 5 10 15 



-89- 



i. 



t ;: 



f3 



Ala Tyr Arg Gly Asp Thr Phe lie Gly Tyr Val Cys Lys Cys Pro Arg 

2 0 2 5 30 

Gly Phe Asn Gly lie His Cys Gin His Asn fie 
35 " 40 

(2) INFORMATION FOR SEQ ID NO: 25; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Cys Glu Val Glu Pro Cys Lys Asn Gly Gly lie Cys Thr Asp Leu Val 
1 5 10 15 

Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn Tyr Ser Cys Glu Cys Pro Gly 

20 25 ' 30 

Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys 
35 40 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Cys Xaa Xaa Xaa Pro Cys Xaa Asn Gly Gly Xaa Cys Xaa Xaa Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Cys Xaa Cys Xaa Xaa 

20 25 30 

Gly Tyr Xaa Gly Xaa Xaa Cys Xaa 
35 40 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



-90- 



e 



3=25 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..309 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

NGTGATATTT GTGATCCCAA TCCATGTGAA AATGGAGGTA TCTGTTTGCC AGGATTGGCT 60 

GTAGGTTCCT TTTCCTGTGA GTGTCCAGAT GGCTTCACAG ACCCCAACTG TTCTAGTGTT 12 0 

GTGGAGGTTG GTCCCTGCAC TCCTAATCCA TGCCATAATG GAGGAACCTG TGAAATAAGT 18 0 

GAAGCATACC GAGGGGATAC ATTCATAGGC TATGTTTGTA AATGTCCCCG AGGATTTAAT 24 0 

GGGATTCACT GTCAGCACAA CATAAATGAA TGCGAAGTTG AGCCTTGCAA AAATGGTGGA 3 00 

ATATGTACAG 310 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 30 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ix) FEATURE : 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 550... 1212 
(D) OTHER INFORMATION: 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 28: 






GAATTCCGGG 


AGGGAGGGTA 


GGGGGGCGGG 


CCGCGGGGGC 


CCAAAGCCAG 


CTAGGCTCAG 


60 


TCTCACACGC 


GCGCCGCCAC 


TGTTTGTATA 


TAGTGCGCTC 


CTGGCCTCAG 


GCTCGCTCCC 


120 


CTCCAGCTCT 


CGCTTCATTG 


TTCTCCAAGT 


CAGAAGCCCC 


CGCATCCGCC 


GCGCAGCAGC 


180 


GTGAGCCGTA 


GTCACTGCTG 


GCCGCTTCGC 


CTGCGTGCGC 


GCACGGAAAT 


CGGGGAGCCA 


240 


GGAACCCAAG 


GAGCCGCCGT 


CCGCCCGCTG 


TGCCTCTGCT 


AGACCACTCG 


CAGCCCCAGC 


300 


CTCTCTCAAG 


CGCACCCACC 


TCCGCGCACC 


CCAGCTCAGG 


CGAAGCTGGA 


GTGAGGGTGA 


360 


ATCACCCTTT 


CTCTAGGGCC 


ACCACTCTTT 


TATCGCCCTT 


CCCAAGATTT 


GAGAAGCGCT 


420 


GCGGGAGGAA 


AGACGTCCTC 


TTGATCTCTG 


ACAGGGCGGG 


GTTTACTGCT 


GTCCTGCAGG 


480 


CGCGCCTCGC 


CTACTGTGCC 


CTCCGCTACG 


ACCCCGGACC 


AGCCCAGGTC 


ACGTCCGTGA 


540 



GAAGGGATC ATG AAG CAC TTG GTA GCA GCC TGG CTT TTG GTT GGA CTC AGC 591 

Met Lys His Leu Val Ala Ala Trp Leu Leu Val Gly Leu Ser 
1 5 10 

CTC GGG GTG CCC CAG TTC GGC AAA GGT GAC ATT TGC AAC CCG AAC CCC 63 9 

Leu Gly Val Pro Gin Phe Gly Lys Gly Asp lie Cys Asn Pro Asn Pro 
15 2 0 2 5 3 0 



-91- 



$ .....J 



TGT GAA AAT GGT GGC ATC TGT CTG TCA GGA CTG GCT GAT GAT TCC TTT 687 
Cys Glu Asn Gly Gly He Cys Leu Ser Gly Leu Ala Asp Asp Ser Phe 

35 40 '* 45 

TCC TGT GAG TGT CCA GAA GGC TTC GCA GGT CCG \AAC TGC TCT AGT GTT 
Ser Cys Glu Cys Pro Glu Gly Phe Ala Gly Pro Asn Cys Ser Ser Val 

50 55 60 



735 



GTG GAG GTT GCA TCA GAT GAA GAA AAG CCT ACT TCA GCA GGT CCC TGC 783 
Val Glu Val Ala Ser Asp Glu Glu Lys Pro Thr Ser Ala Gly Pro Cys 
65 70 75 

ATC CCT AAC CCA TGC CAT AAC GGA GGA ACC TGT GAG ATA AGC GAA GCC 831 
He Pro Asn Pro Cys His Asn Gly Gly Thr Cys Glu He Ser Glu Ala 
8 0 85 90 

TAT CGA GGA GAC ACA TTC ATA GGC TAT GTT TGT AAA TGT CCT CGG GGA" 8 79 

Tyr Arg Gly Asp Thr Phe He Gly Tyr Val Cys Lys Cys Pro Arg Gly 
95 100 105 no 



TTT AAT GGG ATT CAC TGT CAG CAC AAT ATA AAT GAA TGT GAA GCT GAG 92 7 

Phe Asn Gly He His Cys Gin His Asn lie Asn Glu Cys Glu Ala Glu 

PJ 115 120 " 125 

M CCT TQC AGA ^ T GQC GGA GAG ^ c TAC TCT 97 5 

:j Pro Cys Arg Asn Gly Gly He Cys Thr Asp Leu Val Ala Asn Tyr Ser 

iff 130 13 5 14 0 



TGT GAA TGC CCA GGA GAA TTT ATG GGA CGA AAT TGT CAA TAT AAA TGC 1023 
Cys Glu Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cys 
145 150 ~ J 155 

TCT GGG CAC TTG GGA ATC GAA GGT GGG ATC ATA TCT AAT CAG CAA ATC 10 71 

Ser Gly His Leu Gly He Glu Gly Gly He He Ser Asn Gin Gin He 
160 165 170 

ACA GCT TCA TCT AAT CAC CGA GCT CTT TTT GGA CTC CAG AAG TGG TAT 1119 
Thr Ala Ser Ser Asn His Arg Ala Leu Phe Gly Leu Gin Lys Trp Tyr 
175 180 185 190 

CCC TAC TAT GCT AGA CTT AAT AAG AAG GGC CTT ATA AAT GCC TGG ACA 1167 
Pro Tyr Tyr Ala Arg Leu Asn Lys Lys Gly Leu He Asn Ala Trp Thr 

195 200 205 

GCT GCT GAA AAT GAC AGA TGG CCA TGG ATT CAG GTA ACA GTG GGA TGAGA 1217 
Ala Ala Glu Asn Asp Arg Trp Pro Trp He Gin Val Thr Val Gly 

21° 215 220 



CAAATCCATT 


TCCCAAATTA 


TCAGAATCAT 


TATAGAAGTA 


GGTTAGGGAG 


AATTGGCTGT 


1277 


GATTCTTTCT 


CATGGTTAAA 


ATGTGATTTA 


GTTCAGAATT 


AACATGGTTG 


GAAACTCTAA 


1337 


AAAATGTGGA 


AAACAGGAAC 


ATTCTATGTC 


TGAAAATCTG 


AAAATAGCAT 


CAAGATGAAA 


1397 


ACATTCTTTA 


GTCATAAATA 


TACTCTTTTA 


AGTTATAGTA 


GAGAAAAAGA 


TCTTATCATT 


1457 


T C AT AAGTGG 


ACTTTTGGGA 


TAGCATTGGA 


AATGTAAATG 


AAATAAATAC 


CTAATTGAAA 


1517 


AAAGTTTATT 


CTAAAGTGTT 


AATATTTAGC 


AACAGATTCA 


GAGACAAGAA 


AGTAACAATT 


1577 


CAATCTGTGT 


ATTTTTTGTG 


AGAAATAGTT 


TCCCATGTGC 


AAATATAAAG 


TGCGCATCAT 


1637 


ATCATGATAA 


TATCCAACTG 


TCTGCAGAAC 


TCCCTTTCAT 


AAATGAGAGA 


ATTTTAATTC 


1697 



-92- 



* 



t 



ATAGTGCCTT 


ATATCCTCAT 


CAGCCATCTG 


ACTTTACTAC 


AGAAGAAAAC 


AATGAAATGA 


1757 


TGCATTAAGT 


GCTTTGCTAG 


AAGAAACATC 


ATAGCAAAGC 


TGATAGCCCA 


CATTCTGTGC 


1817 


ANNNAAGCTT 


CCAGAGCACT 


CGAGAAAAAG 


CAGAAATGAG 


1 ATGTTTTATG 


AAAACCGAAA 


1877 


AGATAATCTG 


ATTTCTGTGA 


AATATACTTT 


TGATCATGTG 


GTTCTTTAAG 


ATAGTCACTA 


1937 


APAARTfATT 


AC5TAC4PAC5AT 


AfPAAATRnO 


AfiAAAATTTf* 


C AC4TATA CTV? 






CAGTCATGCT 


GAAACTACAT 


GAGGTCAGGA 


AAGTTTTGAA 


ATAAGGTGAT 


TTTGGAAGGA 


2057 


TACCTTCAAC 


TGGCCTAGAT 


TTTCAAGAAA 


CAGTGTAATC 


AACAGCCAAA 


CATGAGAATC 


2117 


TAGCTAACAG 


CATTTAGAAA 


ACCAGAACTA 


AGAGTGTTAC 


TGGGGAATTG 


CATTTAAATC 


2177 


CAGTATGAGA 


GTTTGCAAAT 


GCCGTATTCT 


TCTAAGGGGT 


TTGTGCCACA 


TTTTGTTACC 


2237 


ATGGAGTCCT 


CTGTAAGAAC 


TTTATTAGAT 


AAATCATCTT 


TACACTATAA 


TTTGAATAAA 


2297 


AGCCGGAATT 


C 










2308 



(2) INFORMATION FOR SEQ ID NO; 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 221 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Lys His Leu Val Ala Ala Trp Leu Leu Val Gly Leu Ser Leu Gly 
1 5 10 ' 15 

Val Pro Gin Phe Gly Lys Gly Asp lie Cys Asn Pro Asn Pro Cys Glu 

20 25 30 

Asn Gly Gly lie Cys Leu Ser Gly Leu Ala Asp Asp Ser Phe Ser Cys 
35 40 45 

Glu Cys Pro Glu Gly Phe Ala Gly Pro Asn Cys Ser Ser Val Val Glu 
50 55 60 

Val Ala Ser Asp Glu Glu Lys Pro Thr Ser Ala Gly Pro Cys He Pro 
65 70 75 * 80 

Asn Pro Cys His Asn Gly Gly Thr Cys Glu He Ser Glu Ala Tyr Arg 

85 90 95 

Gly Asp Thr Phe He Gly Tyr Val Cys Lys Cys Pro Arg Gly Phe Asn 

100 105 ~ 110 

Gly He His Cys Gin His Asn He Asn Glu Cys Glu Ala Glu Pro Cys 
115 120 ^ 125 

Arg Asn Gly Gly He Cys Thr Asp Leu Val Ala Asn Tyr Ser Cys Glu 
130 135 140 

Cys Pro Gly Glu Phe Met Gly Arg Asn Cys Gin Tyr Lys Cys Ser Gly 
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145 

His Leu Gly lie 



Ser Ser Asn His 

180 

Tyr Ala Ala Leu 
195 

Glu Asn Asp Arg 
210 



150 

Glu Gly Gly lie 
165 

Arg Ala Leu Phe 



Asn Lys Lys Gly 

200 

Trp Pro Trp lie 
215 



155 

lie Ser Asn Gin 
170 

Gly Leu Gin Lys 
185 

Leu lie Asn Ala 



Gin Val Thr Val 

220 



160 

Gin lie Thr Ala 
175 

Trp Tyr Pro Tyr 
190 

Trp Thr Ala Ala 
205 

Gly 



-94- 



I 

I 



t 

t 



0" 



r :: 



! -jt; 



Xtl 



Sac 




EcbR 



39 C i j 
Y 



! EcoR! 

i 

i 



EcoR 



EcoR[ 



Sac 



Sac I 



Sac [ 



Sac 1 
{ Sac 



-/ 



v 



8 kb deletion 
, ( 2 /./ Z Z Z Z Z Z21 - 

10-20 copies 
transgene 



42 kb 



X clones 



zoo blot-r 
exon trap 

SLM275 
integration 
site clones 



t 




v * ^ 



m-dell 

h-MFG 

h-TV. 

in— FVXXX 
X-A5x>l 
X-A5b2 - 
dis-I 

Consensus 



n;- dell 

h-MFG 

h-FV 

X-A5bl 
X-A5b2 
dis-X 

Consensus 



Bj^LVPTXs/FG 
DLGSSXSvZEG 

DLFNLRF 
M 




>£§3 — A? 



G 



J3LAEEXI 
H 




GSVQr^35S^A^1S£^SA^2 



-iZSG 



— - 




S3 . Y 



I2F 




HCVfiQDEKQR 
?EYQD?RTG 
CPYHLXSSM 
P — QILYNG 
X — TLKDGN* 
MIMDSSKN 
F FY?, 




^G®^3^X©^^pA^rBD T2SE 



HAPSXAX H?LT- 



1 



10 20 30 40 50 60 

1234557890 1234567890 12345S7390 1234567890 1234567890 1234567890 



ScoRI Hp a I 

GAATTCCGGT TAACTGAGGA C AAAGQG T AA TG CAG AAG TG ATATTTGATT TCCATTCTCA 



)3 TTCCCAGTGG CCTTGATATT TAAACTGATT CCTGCCACCA GGTCCTTGGG CCACCCTGTC 



-sp3i Sphl 
CCTGCGTCTC ATATTTCTGC ATGC7GCTTT GT7TGTATAT AGTGCGCTCC TGGCCTCAGG 



ctcgctcccc tccagctctc gcttcattgt tctccaagtc AGAAGCCCCC GCATCCGCCG 



o5Sr.ii 



GGGGAGCCAG GAACCCAAGG AGCCGCCGTC CGCCCGCTGT GCCTCTGCTA GACCACTCGC 



TGAGGGTGAA T'GA CTCC^"^~ rr, C - GG^CC - cC- , C r ^ r ^ r ^"^'* — - "~CG , ° , '~ r " 7r ^C - , — ■, 



o 

( 



3' 



rss: 



10 20 30 40 5Q 50 

1234557890 1234557890 1234567390 1234567890 1234557890 1234567890 



Eco47lII 



Aatll 



AGAAGCGCTG CGGGAGGAAA GACGTCCTCT TGATCTCTGA CL\GGGCGGGG TTTACT 



5*4 



BssHII 
PstI 

TCCTGCAGGC GCGCCTCGCC TACTGTGCCC TCCGCTACGA CCCCGGACCA G C C CAGGTCA 



50 



s : : 
rsFss 



BspHI 

CGTCCGTGAG AAGGGATCAT .GAAGCACTTG GTAGCAGCCT GG CTTTTGG T 

M K H L V A A V/ L L V 



G L S 



rjs.a 



CTCGGGGTGC CCCAGTTCGG CAAAGGTGAC ATTTG CAAC C CGAACCCCTG TGAAAATGGT 
L G V ? Q F G K G D I C N ? N ? C S N G 



72 



3soHI 



GGCATCTGTC TGTCAG 
G 1 C L S G 



.CT GGCTGATGAT TCCTTTTCCT GTGAGTGTCC AGAAGGCTTC 



78 



Li 



D D 



2 C 



GCAGGTCCGA ACTGCTCTAG TGTTGTGGAG ^TTGCATCAG ATG AAG AAAA GG 



N 



V V S 



84 



V 



D 



\* •■ — — — — . , 

f« \j 1 v_ x £> r, A 



N ? C H 



''* R G D 



.CATTCAT AGGC7ATG7T TG' 
FIG V V C 



■p \ > 1 r—\f~*r—if-\ r-vi — s~, ^ si > r-^ 
- "—"*>. .Civ. - \ — ^T\JV3'»3 -TV i. 

K C ? ?„ G 



\ 



I 

I 



10 20 30 40 50 50 

1234557890 12345S7S9Q 1234557390 12345S7390 1234557390 1234567890 



CACTGTCAGC ACAATATAAA TGAATGTGAA GCTGAGCCTT GCA.GAAATGG CGGAATATGT 1020 
H C Q H N I N S C E A S ? C ?, N G G I C 



- b ^_ i lb 

D L V 



CTAACTA CTCTTGTGAA TGCCCAGGAG AATTTATGG; 
NY S C E C ? G E F K G 



AC G AAATTGT 1030 
R N r C 



CAATATAAAT G CTCTGGGC A CTTGGGAA7C G.AAGGTGGGA TGATATCTAA TCAGCAAAT 
Q V K C S G H L G I E G G I I S N Q Q X 



1140 



Sad 
Eel 13 oil 



Em! 4 



ACAGGTTCAT 
T A S S 



TAATCACCG . A.GCTCTTTT 



N 



o 



GGACTCCAGA AGTGGTATCC CTACTATGCT 
G L Q K W V ? Y Y A 



12 0 0 



R\rul I 



Ncol 
MscI 
Ball 



CG ACTT AAT A AGAAGGGCCT TATAAA7GCC TGGACA.GCTG CTGAAAATGA C AG ATGGCCA 12 60 



4 



G 



XT 



D 



R W 



1GGATTCAGA T AAATTTGC A AAGAAAAATG AGAGTCACTG GTGTT ATT AC CCAAGGAGCA 1320 
W I Q I N L Q R :< M R V 7 G V IT Q G A 



2. 2. 2. 2. ^ ^ 2, ' ' T 1 0 ^ 2. i A f~* ^ ' — 1 > x T 1 x "" > rp f~t r-> r~i > /"■» v > > •» (—irj-< »-■'--'— 



V 



V 



f-i -v X /— > -\ > <T-v-> v r-» r~- • 

S N D G 



13S0 



— >*J i> _ 



— — CZ 2- 

:< "'t 



i obVj 



:ajatgta.caa agtaaaagc-c accaatgaag AGATGGT? 

M Y K V KG T 



>? 

, 1 



i\-b: c-o.*iAr 

R G N 



1440 



I 

1 



i - ? 



10 20 30 40 50 50 

1234557890 1234557890 1234557890 1234557890 1234557390 1234557890 



Ndel 

GTTGATAACA ACACACCATA TGCTAATTCT TTCACACCCC CAAT CAAAGC TCAGTATGTA 150 C 



V D N N 



v 



N S 



r 



Q 



v 



V 



AGACTCTACC CCCAAATTTG TCGAAGGCAT TGTACTTTAA GAATGGAACT TCTTGGCTGT 1550 



Q 1 



Z3 p 



C T 



v 



L G C 



• fits 
i 



Sad 
Ecll35ll 

GAGCTCTCAG GCTGTTCAGA ACCTTTGGGG ATG AAAT C AG GGCAT AT AC A AG ACT AC GAG 1520 



E L S G 



S F 



S G 



I 0 



D 



3bsX 

ATCACTGCCT CCAGCGTCTT CAGAACACTC AACATGGA.CA. TGTT7ACTTG GG AACCAAGG 
1 T A S S V F ?, T L N M DM FT w F ? ?, 



loo 



AAA G C C A G G C TGGA.CAAGCA AGGCAAAGTA AATGC CTGG A CTTCCGGCCA TAACGACCAG 
:< A ?. L D K Q G K V N A W T S G H N* D Q 



1743 



TCACAATGGT TACAGGTTGA TCTTCTTGTC CCTACTAAGG TGACAGGCAT C .ATT ACAC AA 13Q0 



Q 



V 



V 



."V 



V 



Q 



GGAGCTAAAG ATTTTGGTCA CGTGCA 
G A K D F 



/ — 1 rj"*rp f~> ^» x r— t > > > >, -» > ^ r—> t^-\ > ^ 



G 



iooO 



AO 2LI 



3 G F K W M V ~ Q 0 FKQF K D K V F O 



O " 



i920 



I 



1 



• -> r— 

~\ £_ o — 

0 



10 20 30 40 50 60 

1234557890 1234567890 1234557590 1234557390 1234557S90 1234557890 



GGCAATTTTG ACAATGACAC T CAGAG3 AAA AATGTC ATCG ACCCTCCCAT CTATGCACGA 19 3 



G is 



D N D 



■ * * s 



D 



A R 



TTCATAAGAA TCCTTCCTTG GTCCTGGTAT GG AAGGATCA CTCTGCGGTC AGAGCTGCTG 204 



hH «"»» 

HI 



> i 



:Spi 

GGCTGCGCAG AGGAGGAATG AAGTG CGGGG CCGCACATCC CACA-.TGCTT TTCTTTATT? 210 

tea _ _ _ 

is G C A H H 3 



T C C T AT AASt ATCTCCACGA AATG A-.CTGT GTGAAGCTGA TGGAA-.CTGC ATTTGTTTTT 21 



TTCAAAGTGT T CAAATT ATG GTAGGCT ACT GACTGTCTTT TTAGGAGTTC TAAGCTTGCC 222 



TTTTT AAT AA TTTAATTTGG TTTCCTTTGC TCAACTCTCT TATGTAATAT C AC ACTGTCT 22S 



230 



» 



<q 
I 



X 



<*. a*) 



'3 



9 13 27 55 45 54 

-> i vi a i. A\j iUA O — ^J,u J.^\i v_v_^- 1 1_ i LCr*. Ao.-. .-.i, 1 iXs 1. XT A A>_A AAG CLrG i 



s 


Lr 




5 


p 


L 


S 


? o 


3 


-■\ J. 


C 


T 


■ 


;< 








AAG 


53 

AGA 






72 
TCT 


TGA 


ol 

ATT CTT 




TAG GGG 


CGG 


ACT 


CTG 


CTG 


r 
i 

CTG' 


108 
CCC 




:< 






s 


S 




J. Lj 




x u 


-=> 

J. s 


S 








p 


i— v** /■"* 
1 


GCT 


117 
GCC 




TCG 


125 
GCT 


-» * > — . » 


135 
CTG CCC 


TCC 


144 
- — . . — \ r+ 


ACC 


CCT 


iD3 
Gr.'w 


CAG 


CCG 


152 
GGG 



AT S A T L ? S A T T ? D Q ? G 

171 120 189 193 207 215 

-/-* 



TO. CGT CCG GGA GAC GGG ATC ATG AAG CGC TCG GTA GCC GTC TGG CTC TTG GTV, 

1 • — ■ 

s ; 

U V "3 

i t 



D G Z Y. :< Pv 5 v A v r, 



r * 225 234 243 252 251 270 

Ui GGG CTC AGC CTC GGT GTC CCC CAG TTC GGC AAA GGT G-T ATT TGT GAT CCC AAT 



G 


Li 


s 


L 


G 


V 


p 


Q 




G 




G 


D 




C 


D 


"3 








279 




GGA 








297 
TTG 


CCA 


GGA 


a 

_>Qo 




GTA 


f>r^- — i 






324 




TGT 


GAA 


AAT 


GGT 


TGT 




GCT 








C 




N 


G 


G 


— -» 

l 


C 


L 


p 


G 








G 










/- \ ,-~» 


333 


CCA 




342 


TTC 




351 






350 




AGT 






GAG 




TGT 


TGT 


-» ' — t 


GGC 


r if 






— x_ - 


GTT 


— r/" 


1 « M 



^ — 




C 


p 


D 


G 




T D 


p 












V 




V 




C-CA 


237 
GAT 


GAA 


GAA 


396 
GAA 




405 
. ~ ■ — — — « 


GCA 




414 
CCC 


- v^"«». 


ACT 


423 
CCT 




CCA 






< — • 


D 




* — < 




p 


T S 


■* 


G 


p 




T 


p 


\: 


p 




CAT 




441 
GGA 


GGA 


ACC 


4 50 
TGT 


GAP-. 


4^^ 
ATA AGT 


GAA 


GCA 


453 
TAG 


< — "> 


*^ ^ 

•w ^ 


477 
GAT- 


-\ >. 


TTC 


435 



C E I S Z A V ?. G D T F 1 
504 513 522 531 540 



4V3 

GTT TGT AAA TGT CCC GGA GGA TTT AAT GGG ATT CAC TGT 



\ 1 — • - — • — • (->■»<'•» ^-vr^ 
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TO 



3D 



557 



J /3 



— *\ ^ > > •* 1 — v — v 



o 




D 



AAC 



T A j. JA.'^ 



512 
TGT 




• 5 -~i 
0 J v 

Qi ^ — ■< — p ^ ■ 



— 1 *i , - J - 
^ -J ^> 



gga a: 



543 



NY SCECPGEFHG ?. N T C Q 



< 

r 



. c. 



657 555 575 534 533 702 

TXC AAA TGG TO. GGC CCA CTG GCA ATT CAA GGT GGA ATT ATA TCA AAC CAG CAA 



V 


r\ 


C 


5 


G 


P 


1j 


G 


X 




/—I 

Vj 




T 


1 


s 




Q 


Q 






711 






720 


CAC 


CC-. 


729 


C 1 




732 






747 


TGG 




755 






GCT 


— ^ *v— . 


TCT 


ACT 






GGA 


CTC 


CAA 


AAA 


TAT 


CCC 




T 


A 




S 






— * 


> 
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G 


L 


Q 


;< 


W 


Y 
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TAT 


765 
GCA 


rep 


CTT 


774 
AAT 


AAG 


xiAG 


783 
GGG 


CTT 




792 
A AT 


GCG 


tg£ 


SOj 
ACA 


GCT 


CCA 


SIC 

— - * » 


V 
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T 


N 


A 


w 
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mm 


AAT 


GAC 


219 
AGA 


1 ■*« -4 

— - — 1N<rf4 


AAC 


G2S 


TGG 


2, TV 


337 
CAG 


AM 


nAT 
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TTG 
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AGA 


855 
AAA 


ATG 


AGA 


354 
GlV 


N 


D 


R 




N 


O 


W 




Q 


> 


N 




Q 


^* 


— 
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R 


V 


ACT 


GGT 


S73 
GTG 


-ATT 


ACC 


So2 






AAG 


> rv-* 




900 

Gvr.-\ 


> <—»/-» 


CCA 
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GAG 


TV? 

-* - ■* 
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G 
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T 
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G 
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G 
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-N. 
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927 335 $45 954 953 972 

CAC AAA ATT GCC TAG ACT AAT CAT GGA AAG ACT TGG GCA ATG TAC AAA CTC 
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GAA 


990 


ATG 




./ _/ _/ 
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GCA AAC 




CAT 
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1025 

ACT CCA 
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GCT 


1035 
1 A-AC 
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TTC 


.044 
AHA 


CCC 


f 
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.053 
ATA 




1052 
'GCT CAG 




GTA 


1071 


CTC 


1030 
TAT CCC 
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1125 
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1134 
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! TCT 




.AGA 


CAT 




ACT 


TTC 


CTT 
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1143 1152 1151 1170 1173 118S 

TCG GGT TCT TCT GAG CCT CTC GGT ATG AAA TCA GGA CAT ATA CAA GAC TAT CAG 



C SEPLGMKSGHXQO 
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may jeopardize the validity of the application or any patent issuing thereon. 



SIGNATURE OF (NvWlOR 16 1 

Thomas Quertermous 


SIGNATURE OF INVENTOR MS ' 

Brigid Hogan 


TI. Ralph Snodgrass 


DATE 


DATE 




SIGNATURE OF INVENTOR 204 


SIGNATURE Of INVENTOR 205 


SIGNATURE OF INVENTOR 206 


Thomas Joel Zupanorc 








DATE 


DATE 
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DECLARATION 
AND POWER OF ATTORNEY 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below at 201 et seq. underneath my name. 

I believe I am the original, first and sole inventor if only one name is listed at 201 below, or an original, first and joint inventor if plural names 
are listed at 201 et seq. below, of the subject matter which is claimed and for which a patent is sought on the invention entitled 

DEVELOPMENTAL!. Y-REGULATED ENDOTHELIAL CELL LOCUS-1 
and for which a patent application: 

□ is attached hereto and includes amendment(s) filed On (if applicable) 

E3 was filed in the United States on June 5, 1996 as Application Serial No. 
with amendment(s) filed on (ifappUcabu) 

□ was filed as PCT international application Serial No. on 

On (tf applicable) 

I hereby state that I have reviewed and understand the contents of the above identified application, including the claims as amended by any 
amendment referred to above. ' 

I acknowledge the duty to disclose information known to me to be material to patentability as defined in Title 37, Code of Federal Regulations, 
§1.56. to 

1 nereb y cIaim foreign priority benefits under Title 35, United States Code, §119(a)-(d) of any foreign application(s) for patent or inventor's 

j certificate listed below and have also identified below any foreign application for patent or inventor's certificate having a filing date before that 
i] of the application on which priority is claimed: 



08/659.235 (for declaration not accompanying application) 

and was amended under PCT Article 19 



EARLIEST FOREIGN APPLICATION(S), IF ANY, FILED PRIOR TO THE FILING DATE OF THE APPLICATION 


APPLICATION NUMBER 


COUNTRY 


DATE OF FILING 
(day, month, year) 


PRIORITY 
CLAIMED 








YES □ NO □ 








YES □ NO □ 



|* I hereby claim the benefit under Title 35, United States Code, § 119(e) of any United States provisional application(s) listed below. 



APPLICATION NUMBER 


FILING DATE 











I hereby claim the benefit under Title 35, United States Code, §120 of any United States application(s) listed below and insofar as the subject 
matter of each of the claims of this application is not disclosed in the prior United States application in the manner provided by the first 
paragraph of Title 35, United States Code §112, I acknowledge the duty to disclose information which is material to patentability as defined 
in Title 37, Code of Federal Regulations, §1.56 which became available between the filing date of the prior application and the national or PCT 
international filing date of this application: 



APPLICATION SERIAL NO. 


FILING DATE 


STATUS 


PATENTED 


PENDING 


ABANDONED 


08/480,229 


June 7, 1995 




X 















POWER OF ATTORNEY: As a named inventor, I hereby appoint S. Leslie M.srock (Reg. No. 18872), Harry C. Jones, III (Reg No 20280) Beri A Terzian 
(Reg. No 20060), Gerald J. Flimoft (Reg No. 20823), David Weild. Ill (Reg. No. 21094), Jonathan A. Marshall (Reg! No 24614) Barry D Re^Reg No 
No ^ 22795Tp 0 hi,7n ^ Zutl V^ "nZ^ Isaac c J " k ° v sky (Reg No. 22713), Joseph V. Colaiann, (Reg. No. 20019). Charle^E. £ 
No. 22795), Phi hp T Shannon (Reg. No. 24278), Francis E. Morns (Reg. No. 24615), Charles E. Muler (Reg. No. 24576), Gidon D. Stern (Reg No 27469) 
John J. Lauter, Jr. (Reg. No. 27814), Brian M. Poissant (Reg. No. 28462), Brian D. Coggio (Reg No 27624) Rorv J Raddine <Re„ Nn X' 1. T 
J R Harbu.a k (Reg No. 29166), Donald J. Goodel, (Reg. No. .9766), James N. Palik (Reg.No. M5*K^E.S (Re nV 29258) llara" A Comzz 

ttKT^'«t f r 3) n R '. (Reg ' N °- 301 1 1) ' A " an A ' FanUCC1 < Re *- No - 30256 >. Geraidrnt F BaTdwTn^Reg No 
31232), Victor N. Balanc.a (Reg. No. 31231). Albert P. Halluin (Reg. No. 25227). Samuel B. Abrams (Reg. No. 30605) Steven I Wallach cRee No 3S4021 

Marcia H . Sundeen (Re£ No. 30893), Paul J. Zegger(Reg. No. 33821). Edmond R. Bannon (Reg. No. 32110), Bruce'j Barker (Reg No 33291) ' Adnane 

Nn a 3 2^). Ann L. Qtolfi, (Reg No 31956), SaraLynn Mandel (Reg. No. 31853) Mark A. Farley (Reg. No. 33170) James G Market (Reg 

mfm^w Tl ( S - "'.I ^ PCnnie & Edm ° ndS - Wh ° Se addreSSeS are 1155 Avenue Americas New Y^rk, New York 

10036 1667 K Street N.W Washington, DC 20006 and 2730 Sand Hill Road, Menlo Park, CA 94025, and each of them my attorneys to prosecute m/s 
application, and to transact all business in the Patent and Trademark Office connected therewith. anomeys. to prosecute tins 
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5END CORRESPONDENCE TO: PENNIE & EDMONDS 

1155 AVENUE OF THE AMERICAS 
NEW YORK, N.Y. 10036-2711 



DIRECT TELEPHONE CALLS TO: 
PENNIE & EDMONDS DOCKETING 
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I FULL NAME 
OF INVENTOR 


LAST NAME 

Quertermous 


FIRST NAME 

Thomas 


MIDDLE NAME 


RESIDENCE & 
CITIZENSHIP 


CITY 

Nashville 


STATE OR FOREIGN COUNTRY 

Tennessee 


COUNTRY OF CITIZENSHIP 

U.S.A. 


POST OFFICE 
ADDRESS 


STREET 

3417 Valley Brook Road 


CITY 

Nashville 


STATE OR COUNTRY 

Tennessee 


ZIP CODE 
37215 


2 
0 
2 


FULL NAME 
OF INVENTOR 


LAST NAME . 

Hogan 


FIRST NAME 

Brigid 


MIDDLE NAME 




RESIDENCE & 
CITIZENSHIP 


CITY 

Brentwood 


STATE OR FOREIGN COUNTRY 

Tennessee 


COUNTRY OF CITIZENSHIP 

United Kingdom 


POST OFFICE 
ADDRESS 


STREET 

1303 Robert E. Lee Lane 


CITY 

Brentwood 


STATE OR COUNTRY 

Tennessee 


ZIP CODE 

37027 


2 
0 
3 


FULL NAME 
OF INVENTOR 


LAST NAME 

Snodgrass 


FIRST NAME 

H. 


MIDDLE NAME 

Ralph - 




RESIDENCE & 
CITIZENSHIP 


CITY 

Powell 


STATE OR FOREIGN COUNTRY 

Ohio 


COUNTRY OF CITIZENSHIP 
U.S.A. 


POST OFFICE 
ADDRESS 


STREET 

650 Retreat Lane 


CITY 

Powell 


STATE OR COUNTRY 

Ohio 


ZIP CODE 

43065 
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0 
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FULL NAME 
OF INVENTOR 


LAST NAME 

Zupancic 


FIRST NAME 

Thomas 


MIDDLE NAME 

Joel 




RESIDENCE 8c 
CITIZENSHIP 


CITY 

Worthington 


STATE OR FOREIGN COUNTRY 

Ohio 


COUNTRY OF CITIZENSHIP 

U.S.A. 


POST OFFICE 
ADDRESS 


STREET 

501 Park Boulevard 


CITY 

Worthington 


STATE OR COUNTRY 

Ohio 


ZIP CODE 

43085 
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FULL NAME 
OF INVENTOR 


LAST NAME 


FIRST NAME 


MIDDLE NAME 




RESIDENCE & 
CITIZENSHIP 


CITY 


STATE OR FOREIGN COUNTRY 


COUNTRY OF CITIZENSHIP 


POST OFFICE 
ADDRESS 


STREET 


CITY 


STATE OR COUNTRY 


ZIP CODE 


2 
0 
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FULL NAME 
OF INVENTOR 


LAST NAME 


FIRST NAME 


MIDDLE NAME 




RESIDENCE & 
- CITIZENSHIP. 


CITY 


STATE OR FOREIGN COUNTRY 


COUNTRY OF CITIZENSHIP 


POST OFFICE 
ADDRESS 


STREET 


CITY 


STATE OR COUNTRY 


ZIP CODE - 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are 
believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such willful false statements 
may jeopardize the validity of the application or any patent issuing thereon. 



SIGNATURE OF INVENT6R iOl 

v. /[^v^^ iil .^j.^^zt 

Thomas Quertermous 


SIGNATURE 6F INVENTOR }tf> 

~Brigid I^an^"^^ 5 ^^" 


SIGNATURE OF INVENT6r ioi 

H. Ralph Snodgrass 


DATE " ' ' 

9-0 


DATE [t 


DATE 


SIGNATURE OF INVENTOR 204 

Thomas Joel Zupancic 


SIGNATURE OF INVENTOR 265 h 


SIGNATURE 6F INVENTOR 166 


DATE 


DATE 


DATE 
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